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RELATED APPLICATIONS 

This application is a continuation-in-part of International Application No. 
PCT/US03/34681, which designated the United States and was filed on October 31, 
2003, published in English, which claims priority to U.S. Provisional Application No. 
60/477,1 1 1 filed June 9, 2003, and to U.S. Provisional Application No. 60/449,945, 
filed on February 25, 2003, and also to U.S. Provisional Application No. 60/423,545, 
filed on November 1, 2002, the entire contents of all applications are incorporated 
herein by reference. 

BACKGROUND OF THE INVENTION 

Diabetes mellitus, a metabolic disease in which carbohydrate utilization is 
reduced and lipid and protein utilization is enhanced, is caused by an absolute or 
relative deficiency of insulin. In the more severe cases, diabetes is characterized by 
chronic hyperglycemia, glycosuria, water and electrolyte loss, ketoacidosis and coma. 
Long term complications include development of neuropathy, retinopathy, 
nephropathy, generalized degenerative changes in large and small blood vessels and 
increased susceptibility to infection. The most common form of diabetes is Type II, 
non-insulin-dependent diabetes that is characterized by hyperglycemia due to 
impaired insulin secretion and insulin resistance in target tissues. Both genetic and 
environmental factors contribute to the disease. For example, obesity plays a major 
role in the development of the disease. Type II diabetes is often a mild form of 
diabetes mellitus of gradual onset. 
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The health implications of Type II diabetes are enormous. In 1995, there were 
135 million adults with diabetes worldwide. It is estimated that close to 300 million 
will have diabetes in the year 2025. (King H., et aL, Diabetes Care, 21(9): 1414-1431 
(1998)). The prevalence of Type II diabetes in the adult population in Iceland is 2.5% 
5 (Vilbergsson, S., et aL, Diabet. Med,, 14(6): 491-498 (1997)), which comprises 
approximately 5,000 people over the age of 34 who have the disease. The high 
prevalence of the disease and increasing population affected shows an unmet medical 
need to define the genetic factors involved in Type II diabetes to more precisely 
define the associated risk factors. Also needed are therapeutic agents for prevention 
10 of Type II diabetes . 

SUMMARY OF THE INVENTION 

As described herein, a locus on chromosome 5q35 has been demonstrated 
which plays a major role in Type II diabetes. The locus, referred to as the Type II 

15 diabetes locus, comprises a nucleic acid that encodes, KChlPl . 

The present invention relates to genes located within the Type II diabetes - 
related locus, particularly nucleic acids comprising the KChlPl gene, and the amino 
acids encoded by these nucleic acids. The invention further relates to pathway 
targeting for drug delivery and diagnosis in identifying those who have Type II 

20 diabetes and those at risk of developing Type II diabetes. Also described are 

haplotypes and SNPs that can be used to identify individuals with Type II diabetes or 
at risk of developing Type II diabetes, particularly in those that are non-obese. As a 
consequence, intervention can be prescribed to these individuals before symptoms of 
the disease present, e.g., dietary changes, exercise and/or medication. Identification 

25 of genes in the Type II diabetes locus can pave the way for a better understanding of 
the disease process, which in turn can lead to improved diagnostics and therapeutics. 

The present invention pertains to methods of diagnosing a susceptibility to 
Type II diabetes in an individual, comprising detecting a polymorphism in a KChlPl 
nucleic acid, wherein the presence of the polymorphism in the nucleic acid is 

30 indicative of a susceptibility to Type II diabetes. The invention additionally pertains 
to methods of diagnosing Type II diabetes in an individual, comprising detecting a 



polymorphism in a KChlPl nucleic acid, wherein the presence of the polymorphism 
in the nucleic acid is indicative of Type II diabetes. In one embodiment, in 
diagnosing Type II diabetes or susceptibility to Type II diabetes by detecting the 
presence of a polymorphism in a KChlPl nucleic acid, the presence of the 
polymorphism in the KChlPl nucleic acid can be indicated, for example, by the 
presence of one or more of the polymorphisms indicated in Table 10. 

In other embodiments, the invention relates to methods of diagnosing a 
susceptibility to Type II diabetes in an individual, comprising detecting an alteration 
in the expression or composition of a polypeptide encoded by a KChlPl nucleic acid 
in a test sample, in comparison with the expression or composition of a polypeptide 
encoded by a KChlPl nucleic acid in a control sample, wherein the presence of an 
alteration in expression or composition of the polypeptide in the test sample is 
indicative of a susceptibility to Type II diabetes. The invention additionally relates to 
a method of diagnosing Type II diabetes in an individual, comprising detecting an 
alteration in the expression or composition of a polypeptide encoded by a KChlPl 
nucleic acid in a test sample, in comparison with the expression or composition of a 
polypeptide encoded by KChlPl nucleic acid in a control sample, wherein the 
presence of an alteration in expression or composition of the polypeptide in the test 
sample is indicative of Type II diabetes. 

The invention also relates to an isolated nucleic acid molecule comprising a 
KChlPl nucleic acid (e.g., SEQ ID NO: 1 or the complement of SEQ ID NO:l). In 
certain embodiments, the KChlPl nucleic acid comprises one or more nucleotide 
sequence(s) selected from the group of nucleic acid sequences as shown in Table 10 
(e.g., SEQ ID NOs: 1 14-258) and the complements of the group of nucleic acid 
sequences as shown in Table 10. For example, in certain embodiments, the nucleotide 
sequence contains one or more polymorphism(s), such as those shown in Table 10. In 
another embodiment, the invention relates to an isolated nucleic acid molecule which 
hybridizes under high stringency conditions to a nucleotide sequence selected from 
the group of SEQ ID NO: 1 and the complement of SEQ ID NO: 1 . In certain 
embodiments, the isolated nucleic acid molecule hybridizes under high stringency 
conditions to a nucleotide sequence comprising one or more nucleotide sequence(s) 
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selected from the group of nucleic acid sequences as shown in Table 10 {e.g., SEQ ID 
NOs: 1 14-258) and the complements of the group of nucleic acid sequences as shown 
in Table 10. For example, in certain embodiments, the nucleotide sequence contains 
one or more polymorphism(s), such as those shown in Table 10. 
5 Also contemplated by the invention is a method of assaying for the presence 

of a first nucleic acid molecule in a sample, comprising contacting said sample with a 
second nucleic acid molecule, where the second nucleic acid molecule comprises at 
least one (or more) nucleic acid sequence(s) selected from the group of SEQ ID NOs: 
1 and 1 14-258, inclusive, wherein the nucleic acid sequence hybridizes to the first 
10 nucleic acid under high stringency conditions. In certain embodiments, the second 
nucleic acid molecule contains one or more polymorphism(s), such as those shown in 
Table 10. 

The invention also relates to a vector comprising an isolated nucleic acid 
molecule of the invention {e.g., SEQ ID NOs: 1 and 1 14-258; optionally including 

15 one or more of the polymorphisms shown in Table 10) operably linked to a regulatory 
sequence, as well as to a recombinant host cell comprising the vector. The invention 
also provides a method for producing a polypeptide encoded by an isolated nucleic 
acid molecule having a polymorphism, comprising culturing the recombinant host cell 
under conditions suitable for expression of the nucleic acid molecule. 

20 Also contemplated by the invention is a method of assaying for the presence 

of a polypeptide encoded by an isolated nucleic acid molecule of the invention in a 
sample, the method comprising contacting the sample with an antibody that 
specifically binds to the encoded polypeptide. 

The invention further pertains to a method of identifying an agent that alters 

25 expression of a KChlPl nucleic acid, comprising: contacting a solution containing a 
nucleic acid comprising the promoter region of the KChlPl gene operably linked to a 
reporter gene, with an agent to be tested; assessing the level of expression of the 
reporter gene in the presence of the agent; and comparing the level of expression of 
the reporter gene in the presence of the agent with a level of expression of the reporter 

30 gene in the absence of the agent; wherein if the level of expression of the reporter 
gene in the presence of the agent differs, by an amount that is statistically significant, 



from the level of expression in the absence of the agent, then the agent is an agent that 
alters expression of the KChlPl gene or nucleic acid. An agent identified by this 
method is also contemplated. 

The invention additionally comprises a method of identifying an agent that 
alters expression of a KChlPl nucleic acid, comprising contacting a solution 
containing a nucleic acid of the invention or a derivative or fragment thereof, with an 
agent to be tested; comparing expression of the nucleic acid, derivative or fragment in 
the presence of the agent with expression of the nucleic acid, derivative or fragment in 
the absence of the agent; wherein if expression of the nucleic acid, derivative or 
fragment in the presence of the agent differs, by an amount that is statistically 
significant, from the expression in the absence of the agent, then the agent is an agent 
that alters expression of the KChlPl nucleic acid. In certain embodiments, the 
expression of the nucleic acid, derivative or fragment in the presence of the agent 
comprises expression of one or more splicing variants(s) that differ in kind or in 
quantity from the expression of one or more splicing variant(s) the absence of the 
agent. Agents identified by this method are also contemplated. 

Representative agents that alter expression of a KChlPl nucleic acid 
contemplated by the invention include, for example, antisense nucleic acids to a 
KChlPl gene or nucleic acid; a KChlPl gene or nucleic acid; a KChlPl polypeptide; 
a KChlPl gene or nucleic acid receptor, or other receptor; a KChlPl binding agent; a 
peptidomimetic; a fusion protein; a prodrug thereof; an antibody; and a ribozyme. A 
method of altering expression of a KChlPl nucleic acid, comprising contacting a cell 
containing a nucleic acid with such an agent is also contemplated. 

The invention further pertains to a method of identifying a polypeptide which 
interacts with a KChlPl polypeptide (e.g., a KChlPl polypeptide encoded by a 
nucleic acid of the invention, such as a nucleic acid comprising one or more 
polymorphism(s) indicated in Table 10), comprising employing a yeast two-hybrid 
system using a first vector which comprises a nucleic acid encoding a DNA binding 
domain and a KChlPl polypeptide, splicing variant, or a fragment or derivative 
thereof, and a second vector which comprises a nucleic acid encoding a transcription 
activation domain and a nucleic acid encoding a test polypeptide. If transcriptional 
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activation occurs in the yeast two-hybrid system, the test polypeptide is a polypeptide, 
which interacts with a KChlPl polypeptide. 

In certain methods of the invention, a Type II diabetes therapeutic agent is 
used. The Type II diabetes therapeutic agent can be an agent that alters (e.g., 
5 enhances or inhibits) KChlPl polypeptide activity and/or KChlPl nucleic acid 
expression, as described herein {e.g., a nucleic acid agonist or antagonist). 

Type II diabetes therapeutic agents can alter polypeptide activity or nucleic 
acid expression of a KChlPl nucleic acid by a variety of means, such as, for example, 
by providing additional polypeptide or upregulating the transcription or translation of 

10 the nucleic acid encoding the KChlPl polypeptide; by altering posttranslational 
processing of the KChlPl polypeptide; by altering transcription of splicing variants; 
or by interfering with polypeptide activity {e.g., by binding to the KChlPl 
polypeptide, or by binding to another polypeptide that interacts with KChlPl, such as 
a KChlPl binding agent as described herein), by altering {e.g., downregulating) the 

15 expression, transcription or translation of a nucleic acid encoding KChlPl ; or by 
altering interaction among KChlPl and a KChlPl binding agent. 

In a further embodiment, the invention relates to Type II diabetes therapeutic 
agent, such as an agent selected from the group consisting of: a KChlPl nucleic acid 
or fragment or derivative thereof; a polypeptide encoded by a KChlPl nucleic acid 

20 (e.g., encoded by a KChlPl nucleic acid having one or more polymorphism(s) such as 
those set forth in Table 10); a KChlPl receptor; a KChlPl binding agent; a 
peptidomimetic; a fusion protein; a prodrug; an antibody; an agent that alters KChlPl 
gene or nucleic acid expression; an agent that alters activity of a polypeptide encoded 
by a KChlPl gene or nucleic acid; an agent that alters posttranscriptional processing 

25 of a polypeptide encoded by a KChlPl gene or nucleic acid; an agent that alters 
interaction of a KChlPl polypeptide with a KChlPl binding agent or receptor; an 
agent that alters transcription of splicing variants encoded by a KChlPl gene or 
nucleic acid; and ribozymes. The invention also relates to pharmaceutical 
compositions comprising at least one Type II diabetes therapeutic agent as described 

30 herein. 
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The invention also pertains to a method of treating a disease or condition 
associated with a KChlPl polypeptide (e.g., Type II diabetes) in an individual, 
comprising administering a Type II diabetes therapeutic agent to the individual, in a 
therapeutically effective amount. In certain embodiments, the Type II diabetes 
5 therapeutic agent is a KChlPl agonist; in other embodiments, the Type II diabetes 
therapeutic agent is a KChlPl antagonist. The invention additionally pertains to use 
of a Type II diabetes therapeutic agent as described herein, for the manufacture of a 
medicament for use in the treatment of Type II diabetes, such as by the methods 
described herein. 

10 A transgenic animal comprising a nucleic acid selected from the group 

consisting of: an exogenous KChlPl gene or nucleic acid and a nucleic acid encoding 
a KChlPl polypeptide, is further contemplated by the invention. 

In yet another embodiment, the invention relates to a method for assaying a 
sample for the presence of a KChlPl nucleic acid, comprising contacting the sample 

15 with a nucleic acid comprising a contiguous nucleotide sequence which is at least 
partially complementary to a part of the sequence of said KChlPl nucleic acid under 
conditions appropriate for hybridization, and assessing whether hybridization has 
occurred between a KChlPl nucleic acid and said nucleic acid comprising a 
contiguous nucleotide sequence which is at least partially complementary to a part of 

20 the sequence of said KChlPl nucleic acid; wherein if hybridization has occurred, a 
KChlPl nucleic acid is present in sample. In certain embodiments, the contiguous 
nucleotide sequence is completely complementary to a part of the sequence of said 
KChlPl nucleic acid. If desired, amplification of at least part of said KChlPl nucleic 
acid can be performed. 

25 In certain other embodiments, the contiguous nucleotide sequence is 100 or 

fewer nucleotides in length and is either at least 80% identical to a contiguous 
sequence of nucleotides of one or more of SEQ ID NOs: 1 and 1 14-258; at least 80% 
identical to the complement of a contiguous sequence of nucleotides of one or more of 
SEQ ID NOs: 1 and 1 14-258; or capable of selectively hybridizing to said KChlPl 

30 nucleic acid. 
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In other embodiments, the invention relates to a reagent for assaying a sample 
for the presence of a KChlPl gene or nucleic acid, the reagent comprising a 
contiguous nucleotide sequence which is at least partially complementary to a part of 
the nucleic acid sequence of said KChlPl gene or nucleic acid; or comprising a 
5 contiguous nucleotide sequence which is completely complementary to a part of the 
nucleic acid sequence of said KChlPl gene or nucleic acid. Also contemplated by the 
invention is a reagent kit, e.g., for assaying a sample for the presence of a KChlPl 
nucleic acid, comprising (e.g., in separate containers) one or more labeled nucleic 
acids comprising a contiguous nucleotide sequence which is at least partially 

10 complementary to a part of the nucleic acid sequence of the KChlPl nucleic acid, and 
reagents for detection of said label. In certain embodiments, the labeled nucleic acid 
comprises a contiguous nucleotide sequence that is completely complementary to a 
part of the nucleotide sequence of said KChlPl gene or nucleic acid. In other 
embodiments, the labeled nucleic acid can comprise a contiguous nucleotide sequence 

15 which is at least partially complementary to a part of the nucleotide sequence of said 
KChlPl gene or nucleic acid, and which is capable of acting as a primer for said 
KChlPl nucleic acid when maintained under conditions for primer extension. 

The invention also provides for the use of a nucleic acid which is 100 or fewer 
nucleotides in length and which is either: a) at least 80% identical to a contiguous 

20 sequence of nucleotides of one or more of SEQ ID NOs: 1 and 1 14-258; b) at least 
80% identical to the complement of a contiguous sequence of nucleotides of one or 
more of SEQ ID NOs: 1 and 1 14-258; or c) capable of selectively hybridizing to said 
KChlPl nucleic acid, for assaying a sample for the presence of a KChlPl nucleic 
acid. 

25 In yet another embodiment, the use of a first nucleic acid which is 100 or 

fewer nucleotides in length and which is either: a) at least 80% identical to a 
contiguous sequence of nucleotides of one or more of SEQ ID NOs: 1 and 1 14-258; 
b) at least 80% identical to the complement of a contiguous sequence of nucleotides 
of one or more of SEQ ID NOs: 1 and 1 14-258; or c) capable of selectively 

30 hybridizing to said KChlPl nucleic acid; for assaying a sample for the presence of a 
KChlPl gene or nucleic acid that has at least one nucleotide difference from the first 
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nucleic acid (e.g., a SNP as set forth in Table 10), such as for diagnosing a 
susceptibility to a disease or condition associated with a KChlPl. 

The invention also relates to a method of diagnosing Type II diabetes or a 
susceptibility to Type II diabetes in an individual, comprising determining the 
5 presence or absence in the individual of certain "haplotypes" (combinations of genetic 
markers). In one aspect of the invention of diagnosising a susceptibility of the 
disease, methods are described comprising screening for one of the at-risk haplotypes 
in the KChlPl gene that is more frequently present in an individual susceptible to 
Type II diabetes, compared to the frequency of its presence in the general population, 

10 wherein the presence of an at-risk haplotype is indicative of a susceptibility to Type II 
diabetes. An "at-risk haplotype" is intended to embrace one or a combination of 
haplotypes described herein over the KChlPl gene that show high correlation to Type 
II diabetes. In one embodiment, the at-risk haplotype is characterized by the presence 
of at least one single nucleotide polymorphisms as described in Table 13. In one 

15 embodiment, a haplotype associated with Type II diabetes or a susceptibility to Type 
II diabetes comprises one or more haplotypes identified in Table 2 (haplotypes 
identified as Al, A2, A3, A4, A5, A6, Bl, B2, B3, B4 and B5), Table 4 (haplotypes 
identified as Dl and D2), Table 5 (haplotypes identified as D2, D3, D4, D5 and D6) 
or Table 14 (haplotypes identified as Hap E and Hap E'). In certain embodiments, a 

20 haplotype associated with Type II diabetes or a susceptibility to Type II diabetes 
comprises markers DG5S879, DG5S881, D5S2075, DG5S883 and DG5S38 at the 
5q35 locus; or DG5S1058 and DG5S37 at the 5q35 locus; or DG5S1058, DG5S37 
and DG5S101 at the 5q35 locus; or DG5S881, DG5S1058, D5S2075, DG5S883 and 
DG5S38 at the 5q35 locus; or DG5S879, DG5S1058 and DG5S37; or DG5S881, 

25 D5S2075, DG5S883 and DG5S38 at the 5q35 locus; DG5S953, DG5S955, DG5S13 
and DG5S959 at the 5q35 locus; or DG5S888 and DG5S953 at the 5q35 locus; or 
DG5S953, DG5S955 and DG5S124 at the 5q35 locus; or DG5S888, DG5S44 and 
DG5S953 at the 5q35 locus; or DG5S953, DG5S955, DG5S13, DG5S123, and 
DG5S959 at the 5q35 locus. The presence of the haplotype is diagnostic of Type II 

30 diabetes or of a susceptibility to Type II diabetes. Also described herein is a 
haplotype associated with Type II diabetes or a susceptibility to Type II diabetes 
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comprising markers DG5S13, KCPJ 152, and D5S625 at the 5q35 locus; the 
presence of the haplotype is diagnostic of Type II diabetes or of a susceptibility to 
Type II diabetes. In one particular embodiment, the presence of the -4, 1, 0 haplotype 
at DG5S13, KCP_1 152, and D5S625 is diagnostic of Type II diabetes or of a 
5 susceptibility to Type II diabetes. In another embodiment, a haplotype associated 
with Type II diabetes or a susceptibility to Type II diabetes in an individual, 
comprises markers DG5S124, KCP_1 152, KCP_2649, KPCJ976 and KPC-16152 at 
the 5q35 locus. In one particular embodiment, the presence of the 0, 1, 1, 3 and 0 
haplotype at DG5S124, KCP_1 152, KCP_2649, KPC J976 and KPC-16152 is 

10 diagnostic of Type II diabetes or of a susceptibility to Type II diabetes. In another 
embodiment, a haplotype associated with Type II diabetes or a susceptibility to Type 
II diabetes in an individual, comprises markers KCPJ 73982, KCPJ 5400, and 
KCP_18069. In one particular embodiment, the presence of the 0, 1, 1 haplotype at 
KCP_1 73982, KCPJ 5400, and KCPJ 8069 is diagnostic of Type II diabetes or of a 

15 susceptibility to Type II diabetes. 

In additional embodiments, a haplotype associated with Type II diabetes or a 
susceptibility to Type II diabetes comprises markers DG5S124, KCP_1 152, 
KCP_2649, KCP_4976, and KCPJ6152 at the 5q35 locus, as well as one of the 
following 3 markers: KCPJ 97678, KCPJ 97775, and KCP_202795 at the 5q35 

20 locus; the presence of the haplotype is diagnostic of Type II diabetes or of a 

susceptibility to Type II diabetes. In particular embodiments, the presence of the 0, 3, 
1, 1, 3, 0 haplotype at DG5S124, KCPJ97678, KCPJ 152, KCPJ649, KCPJ976, 
and KCP_16152; the presence of the 0, 3, 1, 1, 3, 0 haplotype at DG5S124, 
KCPJ97775, KCPJ 152, KCP J649, KCP _4976, and KCPJ6152; or the presence 

25 of the 0, 1, 1, 1, 3, 0 haplotype at DG5S124, KCP_202795, KCPJ 152, KCPJ649, 
KCPJ976, and KCPJ6152; is diagnostic of Type II diabetes or of a susceptibility to 
Type II diabetes. 

In additional embodiments, a haplotype associated with Type II diabetes or a 
susceptibility to Type II diabetes comprises markers rsl 032856, KCPJIS888934, 
30 KCP_93545, KCPJ02882, 169234, KCPJ 86048 and KCPJ6152, as well as 
markers rsl032856, KCP_RS888934, KCP_93545, KCPJ 02882, 169234, 
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KCP_1 86048, KCP_1 97775 and KCPJ6152 at the 5q35 locus; the presence of the 
haplotype is diagnostic of Type II diabetes or of a susceptibility to Type II diabetes. 
In particular embodiments, the presence of the G, G, T, C, G, G, A haplotype at 
rsl032856, KCP_RS888934, KCP_93545, KCPJ02882, 169234, KCP_1 86048 and 
5 KCPJ6152, or the presence of the G, G, T, C, G, G, C, A haplotype at rsl032856, 
KCP_RS888934, KCP_93545, KCP_102882, 169234, KCPJ86048, KCP_197775 
and KCP_16152 is diagnostic of Type II diabetes or of a susceptibility to Type II 
diabetes. 

The presence or absence of the haplotype can be determined by various 

10 methods, including, for example, using enzymatic amplification of nucleic acid from 
the individual, electrophoretic analysis, restriction fragment length polymorphism 
analysis and/or sequence analysis. 

Also described herein is a method of diagnosing Type II diabetes in an 
individual, comprising determining the presence or absence in the individual of a 

15 haplotype comprising one or more markers and/or single nucleotide polymorphisms 
as shown in Table 10, Table 2, Table 4, Table 5, Table 13 and/or Table 14 in the locus 
on chromosome 5q35, wherein the presence of the haplotype is diagnostic of Type II 
diabetes. Also contemplated is a method of diagnosing a susceptibility to Type II 
diabetes in an individual, comprising determining the presence or absence in the 

20 individual of a haplotype comprising one or more markers and/or single nucleotide 
polymorphisms as shown in Table 10, Table 13 and/or Table 14 in the locus on 
chromosome 5q35, wherein the presence of the haplotype is diagnostic of a 
susceptibility to Type II diabetes. 

A method for the diagnosis and identification of a susceptibility to Type II 

25 diabetes in an individual is also described, comprising: screening for an at-risk 

haplotype in the KChlPl nucleic acid that is more frequently present in an individual 
susceptible to Type II diabetes compared to an individual who is not susceptible to 
Type II diabetes, wherein the at-risk haplotype increases the risk significantly. In 
certain embodiments, the significant increase is at least about 20% or the significant 

30 increase is identified as an odds ratio of at least about 1 .2. 
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In another embodiment, the invention features a method of diagnosing a 
predisposition or susceptibility to Type II diabetes in a subject, comprising detecting 
the presence or absence of a genetic marker associated with the KChlPl gene, the 
marker having a p-value of lxl 0" 5 or less, wherein the presence of the marker 
5 associated with the KChlPl gene is indicative of a predisposition or susceptibility to 
Type II diabetes. 

In another embodiment, the invention features a method of diagnosing a 
predisposition or susceptibility to an Type II diabetes associated condition in a 
subject, comprising detecting the presence or absence of a genetic marker associated 

10 with the KChlPl gene, the marker having a p-value of lxlO" 5 or less, wherein the 
presence of the marker associated with the KChlPl gene is indicative of a 
predisposition or susceptibility to an Type II diabetes associated condition. 

In other embodiments, the at-risk haplotype has a relative risk of at least 1.5, 
at least 2.5 or at least 3.0. In other embodiments, the at-risk haplotype associated with 

15 the TXNIPH gene has a p-value of lxlO' 5 or less, lxlO' 6 or less, 1x10 7 or less or 
lxl 0* 8 or less. 

A major application of the current invention involves prediction of those at 
higher risk of developing a Type II diabetes. Diagnostic tests that define genetic 
factors contributing to Type II diabetes might be used together with or independent of 

20 the known clinical risk factors to define an individual's risk relative to the general 
population. Better means for identifying those individuals at risk for Type II diabetes 
should lead to better prophylactic and treatment regimens, including more aggressive 
management of the current clinical risk factors. 

Another application of the current invention is the specific identification of a 

25 rate-limiting pathway involved in Type II diabetes. A disease gene with genetic 
variation that is significantly more common in diabetic patients as compared to 
controls represents a specifically validated causative step in the pathogenesis of Type 
II diabetes. That is, the uncertainty about whether a gene is causative or simply 
reactive to the disease process is eliminated. The protein encoded by the disease gene 

30 defines a rate-limiting molecular pathway involved in the biological process of Type 
II diabetes predisposition. The proteins encoded by such Type II genes or its 
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interacting proteins in its molecular pathway may represent drug targets that may be 
selectively modulated by small molecule, protein, antibody, or nucleic acid therapies. 
Such specific information is greatly needed since the population affected with Type II 
diabetes is growing. 

5 A third application of the current invention is its use to predict an individual's 

response to a particular drug, even drugs that do not act on KChlPl or its pathway. It 
is a well-known phenomenon that in general, patients do not respond equally to the 
same drug. Much of the differences in drug response to a given drug is thought to be 
based on genetic and protein differences among individuals in certain genes and their 

10 corresponding pathways. Our invention defines the association of KChlPl with Type 
II diabetes. Some current or future therapeutic agents may be able to affect this gene 
directly or indirectly and therefore, be effective in those patients whose Type II 
diabetes risk is in part determined by the KChlPl genetic variation. On the other 
hand, those same drugs may be less effective or ineffective in those patients who do 

15 not have at risk variation in the KChlPl gene. Therefore, KChlPl variation or 
haplotypes may be used as a pharmacogenomic diagnostic to predict drug response 
and guide choice of therapeutic agent in a given individual. 

BRIEF DESCRIPTION OF THE DRAWINGS 
20 The foregoing and other objects, features and advantages of the invention will 

be apparent from the following more particular description of preferred embodiments 

of the invention, as illustrated in the accompanying drawings. 

FIG.1.1 through 1.148 show the KChlPl genomic DNA (SEQ ID NO: 1). 

This sequence is taken from NCBI Build 33. The numbering in FIG. 1, as well as the 
25 "start" and "end" numbers in all Tables refer to the location in Chromosome 5 in 

NCBI Build 33. The numbering in FIG. 1 refers to the last base in the line 

immediately preceding the number; the numbers are in decreasing order because of 

the "reverse orientation" of the gene. 

FIG. 2 shows the amino acid sequence of KChlPl as published by An et al 
30 Nature, 403(6768): 553-6 (2000) (SEQ ID NO: 2). 
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FIG. 3 shows the nucleic acid sequence (SEQ ID NO: 3) encoding the amino 
acid sequence of KChlPl as published by An et al , Nature, 403(6768): 553-6 (2000) 
(SEQ ID NO: 2). 

FIG. 4 is a series of graphs showing the results of a genome-wide scan using 
5 906 microsatellite markers. Results are shown for three phenotypes: all Type II 
diabetics (solid lines), obese Type II diabetics (dotted lines) and non-obese Type II 
diabetics (dashed lines). The multipoint allele-sharing LOD-score is on the vertical 
axis, and the centimorgan distance from the P-terminus of the chromosome is on the 
horizontal axis. 

10 FIG. 5 graphically depicts the multipoint allele-sharing LOD-score of the 

locus on chromosome 5 after 38 microsatellite markers have been added to the 
framework set in a 40-cM interval, from 160 cM to 200 cM. Results are shown for 
the same three phenotypes as in FIG. 4; all Type II diabetics (solid line), non-obese 
Type II diabetics (dashed line) and obese Type II diabetics (dotted line) the results of 

15 a genome- wide scan using 906 microsatellite markers. 

FIG. 6 graphically depicts the single-marker and haplotype association within 
the 1-LOD-drop for 590 non-obese diabetics vs 477 unrelated population controls. 
The location of the markers and haplotypes is on the horizontal axis and the 
corresponding two-sided P-value on the vertical axis. All haplotypes with a P-value 

20 less than 0.01 are shown. The horizontal bars indicate the span of the corresponding 
haplotypes and the marker density is shown at the bottom of the figure. All locations 
refer to NCBI Build 33 and the 1-LOD-drop spans from 167.64 to 171.28 Mb. 

FIG. 7 schematically shows the location of genes and markers in region B. 
The microsatellites used in the locus-wide association study are shown as filled circles 

25 at the top. The filled boxes indicate the locations of exons, or clusters of exons, for 
KCHIP1 . The shaded boxes indicated the location and size of the neighboring genes, 
LCP2, KCNMB1, GABRP and RANBP17, and the grey horizontal lines indicate the 
span of the five most significant microsatellite haplotypes in the region. 



30 
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DETAILED DESCRIPTION OF THE INVENTION 

Extensive genealogical information for a population with population-based 
lists of patients with Type II diabetes has been combined with powerful gene sharing 
methods to map a locus on chromosome 5q35. Diabetics and their relatives were 
5 genotyped with a genome-wide marker set including 906 microsatellite markers, with 
an average marker density of 4cM. Due to the role obesity plays in the development 
of diabetes, the material was fractionated according to body mass index (BMI). 
Presented herein are results of a genome wide search of genes that cause Type II 
diabetes in Iceland. 

10 

Loci Associated with Diabetes 

Evidence for genes causing the early onset monogenic form of diabetes have 
been previously identified. Mutations in six genes have been discovered that cause 
MODY, or maturity onset diabetes of the young. MODY1 - MODY6 are due to 

15 mutations in HNF4a, glucokinase, HNFla, IPF1, HNFlb and NEUROD1 (MODY1 : 
Yamagata K, et al t Nature 384:458-460 (1996); MODY2: Froguel P, F et al, 
Nature 356: 162-164(1992); MODY3: Yamagata, K., etal, Nature 384: 455-458 
(1996); MODY4: Yoshioka M., et al, Diabetes May;46(5):887-94 (1997) MODY5: 
Horikawa, Y., etal, Nat. Genet. 17: 384-385 (1997) MODY6: Kristinsson S.Y., et 

20 al, Diabetologia Nov:44(l 1):2098-103 (2001)). 

One gene has been identified as a disease gene that contributes to the late- 
onset form of diabetes, the calpain 10 gene (CAPN10). CAPN10, was identified 
though a genome-wide screen of Mexican American sibpairs with diabetes 
(Horikawa, Y., et al, Nat. Genet. 26(2) 163-175(2000)). The risk allele has been 

25 shown to be associated with impaired regulation of glucose-induced secretion and 

decreased rate of insulin-stimulated glucose disposal (Lynn, S., et al, Diabetes, 51(1): 
247-250 (2002); Sreenan, S.K., et al., Diabetes 50(9) 2013-2020 (2001) and Baier, L. 
J., et al, J. Clin. Invest 106(7) R69-73 (2000)). 

Many genome-wide screens in a variety of populations have been performed 

30 that have resulted in major loci for Diabetes. Loci are reported on chromosome 2q37 
(Hanis, C.L., et aL, Nat. Genet, 13(2):161-166 (1996)), chromosome 15q21 (Cox, et 
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al, Nat Genet. 21(2):213-215 (1999)), chromosome 10q26 (Duggirala, R., etal,Am. 
J. Hum. Genet., 68(5): 1 149-1 164 (2001)), chromosome 3p (Ehm, M.G., et al 9 Am. J. 
Hum. Genet., 66(6):1871-1881 (2000)) in Mexican Americans, and chromosomes 
lq21-23 and 1 Iq23-q25 (Hanson R. L. et al, Am J. Hum Genet., 63(4): 1 130-1 138 
5 (1998)) in PIMA Indians. In the Caucasian population, linkages have been observed 
to chromosome 12q24 in Finns (Mahtani, et al, Nat. Genet, 14(l):90-4 (1994)), 
chromosome Iq21-q23 in Americans in Utah (Elbein, S.C., et al, Diabetes, 
48(5):1175-1182 (1999)), chromosome 3q27-pterin French families (Vionnet, N., et 
al, Am. J. Hum. Genet. 67(6): 1470-80 (2000) and chromosome 18pl 1 in 

10 Scandinavians (Parker, A., et al, Diabetes, 50(3) 675-680 (2001)). A recent study 
reported a major locus in indigenous Australians on chromosome 2q24.3 (Busfield, 
F,. et al, Am. J. Hum. Genet, 70(2): 349-357 (2002)). Many other studies have 
resulted in suggestive loci or have replicated these loci. 

Association studies have been reported for Type II diabetes. Most of these 

15 studies show modest association to the disease in a group of people but do not account 
for the disease. Altshuler et al, reviewed the association work that has been done and 
concluded that association to only one of 16 genes revealed held up to scrutiny. 
Altshuler et al, confirmed that the Prol2Ala polymorphism in PPARg is associated 
with Type II diabetes. Until now, there have been no linkage studies in Type II 

20 diabetes linking the disease to chromosome 5q35 

KChlPl 

The invention described herein has linked Type II diabetes to a gene encoding 
Kv channel-interacting protein 1 (KChlPl; also known as KCNIP1). In the brain and 

25 heart, rapidly inactivating (A-type) voltage-gated potassium (Kv) currents operate at 
subthreshold membrane potentials to control the excitability of neurons and cardiac 
myocytes. Although pore-forming alpha-subunits of the Kv4, or Shal-related, channel 
family form A-Type currents in heterologous cells, these differ significantly from 
native A-Type currents. To identify proteins that interacted with the Kv4 subunit, An 

30 et al, ("Modulation of A-Type potassium channels by a family of calcium sensors" 
Nature 403:553-6 (2000)) used the yeast two-hybrid system with the intracellular 
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amino terminus of the rat Kv4.3 subunit to screen rat midbrain cDNA libraries. Two 
Kv channel-interacting proteins were identified and called KChlPs (KChIP-1 and 
KChIP2). Library screening and database mining identified mouse and human 
orthologs of these genes. The KChlPl cDNA encodes a 216-amino acid protein. The 
KChlPs have 4 EF-hand-like domains and bind calcium ions. Both KChlPs have 
distinct N termini but share approximately 70% amino acid identity throughout a 
carboxy-terminal 185-amino acid core domain that contains the 4 EF-hand-like 
motifs. Although the KChlPs have around 40% amino acid similarity to neuronal 
calcium sensor- 1 and are members of the recoverin /NCS subfamily of calcium- 
binding proteins, other members of this subfamily, such as hippocalcin, did not 
interact with Kv4 channels in the yeast 2-hybrid assay. An et al, (supra) additionally 
found that expression of KChlPs and Kv4 together reconstitutes several features of 
native A-Type currents by modulating the density, inactivation kinetics, and rate of 
recovery from inactivation of Kv4 channels in heterologous cells. Both KChlPs 
colocalize and coimmunoprecipitate with brain Kv4 alpha-subunits, and are thus 
integral components of native Kv4 channel complexes. As the activity and density of 
neuronal A-Type currents tightly control responses to excitatory synaptic inputs, these 
KChlPs may regulate A-Type currents, and hence neuronal excitability, in response to 
changes in intracellular calcium. 

The glycosphingolipid sulfatide is present in secretory granules and at the 
surface of pancreatic P-cells (Buschard K, Fredman P. "Sulphatide as an antigen in 
diabetes mellitus". Diabetes NutrMetab 4:221 -228 (1996)), and antisulfatide 
antibodies (ASA; IgGl) are found in serum from the majority of patients with newly 
diagnosed Type I diabetes. Buschard et al, ("Sulfatide controls insulin secretion by 
modulation of ATP-sensitive K(+)-channel activity and Ca(2+)-dependent exocytosis 
in rat pancreatic beta-cells" Diabetes 51:2514-21 (2002)) demonstrated that sulfatide 
produced a glucose- and concentration-dependent inhibition of insulin release from 
isolated rat pancreatic islets. This inhibition of insulin secretion was due to activation 
of ATP-sensitive K + -(K A tp) channels in single rat P-cells. No effect of sulfatide was 
observed on whole-cell Ca 2+ -channel activity or glucose-induced elevation of 
cytoplasmic Ca 2+ concentration. A key observation was that sulfatide stimulated 
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Ca -dependent exocytosis determined by capacitance measurements and depolarized- 
induced insulin secretion from islets exposed to diazoxide and high external KC1. The 
monoclonal sulfatide antibody Sulph I as well as ASA-positive serum reduced 
glucose-induced insulin secretion by inhibition of Ca 2+ -dependent exocytosis. This 
5 suggests that sulfatide is important for the control of glucose-induced insulin secretion 
and that both an increase and a decrease in the sulfatide content have an impact on the 
secretory capacity of the individual P-cells. 

ASSESSMENT FOR AT-RISK HAPLOTYPES 

10 A "haplotype," as described herein, refers to a combination of genetic markers 

("alleles"), such as those set forth in Table 2, Table 4, Table 5 and Table 14. In a 
certain embodiment, the haplotype can comprise one or more alleles, two or more 
alleles, three or more alleles, four or more alleles, or five or more alleles. The genetic 
markers are particular "alleles" at "polymorphic sites" associated with KChPIl . A 

15 nucleotide position at which more than one sequence is possible in a population 
(either a natural population or a synthetic population, e.g., a library of synthetic 
molecules) is referred to herein as a "polymorphic site". Where a polymorphic site is 
a single nucleotide in length, the site is referred to as a single nucleotide 
polymorphism ("SNP"). For example, if at a particular chromosomal location, one 

20 member of a population has an adenine and another member of the population has a 
thymine at the same position, then this position is a polymorphic site, and, more 
specifically, the polymorphic site is a SNP. Polymorphic sites can allow for 
differences in sequences based on substitutions, insertions or deletions. Each version 
of the sequence with respect to the polymorphic site is referred to herein as an "allele" 

25 of the polymorphic site. Thus, in the previous example, the SNP allows for both an 
adenine allele and a thymine allele. 

Typically, a reference sequence is referred to for a particular sequence. 
Alleles that differ from the reference are referred to as "variant" alleles. For example, 
the reference KChPIl sequence is described herein by SEQ ID NO: 1 . The term, 

30 "variant KChPIl ", as used herein, refers to a sequence that differs from SEQ ID NO: 
1 but is otherwise substantially similar. The genetic markers that make up the 



-19- 



haplotypes described herein are KChPIl variants. Additional variants can include 
changes that affect a polypeptide, e.g., the KChPIl polypeptide. These sequence 
differences, when compared to a reference nucleotide sequence, can include the 
insertion or deletion of a single nucleotide, or of more than one nucleotide, resulting 
in a frame shift; the change of at least one nucleotide, resulting in a change in the 
encoded amino acid; the change of at least one nucleotide, resulting in the generation 
of a premature stop codon; the deletion of several nucleotides, resulting in a deletion 
of one or more amino acids encoded by the nucleotides; the insertion of one or several 
nucleotides, such as by unequal recombination or gene conversion, resulting in an 
interruption of the coding sequence of a reading frame; duplication of all or a part of a 
sequence; transposition; or a rearrangement of a nucleotide sequence, as described in 
detail above. Such sequence changes alter the polypeptide encoded by a KChPIl 
nucleic acid. For example, if the change in the nucleic acid sequence causes a frame 
shift, the frame shift can result in a change in the encoded amino acids, and/or can 
result in the generation of a premature stop codon, causing generation of a truncated 
polypeptide. Alternatively, a polymorphism associated with Type II diabetes or a 
susceptibility to Type II diabetes can be a synonymous change in one or more 
nucleotides (i.e., a change that does not result in a change in the amino acid 
sequence). Such a polymorphism can, for example, alter splice sites, affect the 
stability or transport of mRNA, or otherwise affect the transcription or translation of 
the polypeptide. The polypeptide encoded by the reference nucleotide sequence is the 
"reference" polypeptide with a particular reference amino acid sequence, and 
polypeptides encoded by variant alleles are referred to as "variant" polypeptides with 
variant amino acid sequences. 

Haplotypes are a combination of genetic markers, e.g., particular alleles at 
polymorphic sites. The haplotypes described herein, e.g., having markers such as 
those shown in Table 6, Table 7, Table 9, Table 11, Table 12, Table 13 and Table 14 
are found more frequently in individuals with Type II diabetes than in individuals 
without Type II diabetes. Therefore, these haplotypes have predictive value for 
detecting Type II diabetes or a susceptibility to Type II diabetes in an individual. The 
haplotypes described herein are a combination of various genetic markers, e.g., SNPs 
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and microsatellites. Therefore, detecting haplotypes can be accomplished by methods 
known in the art for detecting sequences at polymorphic sites, such as the methods 
described above. 

In certain methods described herein, an individual who is at risk for Type II 
diabetes is an individual in whom an at-risk haplotype is identified. In one 
embodiment, the at-risk haplotype is one that confers a significant risk of Type II 
diabetes. In one embodiment, significance associated with a haplotype is measured 
by an odds ratio. In a further embodiment, the significance is measured by a 
percentage. In one embodiment, a significant risk is measured as an odds ratio of at 
least about 1.2, including but not limited to: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8 and 1.9. In 
a further embodiment, an odds ratio of at least 1 .2 is significant. In a further 
embodiment, an odds ratio of at least about 1 .5 is significant. In a further 
embodiment, a significant increase in risk is at least about 1.7 is significant. In a 
further embodiment, a significant increase in risk is at least about 20%, including but 
not limited to about 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 
80%, 85%, 90%, 95% and 98%. In a further embodiment, a significant increase in 
risk is at least about 50%. It is understood however, that identifying whether a risk is 
medically significant may also depend on a variety of factors, including the specific 
disease, the haplotype, and often, environmental factors. 

An at-risk haplotype in, or comprising portions of, the KChPIl gene, is one 
where the haplotype is more frequently present in an individual at risk for Type II 
diabetes (affected), compared to the frequency of its presence in a healthy individual 
(control), and wherein the presence of the haplotype is indicative of Type II diabetes 
or susceptibility to Type II diabetes. 

Standard techniques for genotyping for the presence of SNPs and/or 
microsatellite markers can be used, such as fluorescent-based techniques (Chen, et al 9 
Genome Res. P, 492 (1999)), PCR, LCR, Nested PCR and other techniques for nucleic 
acid amplification. In one embodiment, the method comprises assessing in an 
individual the presence or frequency of SNPs and/or microsatellites in, comprising 
portions of, the KChlPlgene, wherein an excess or higher frequency of the SNPs 
and/or microsatellites compared to a healthy control individual is indicative that the 
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individual has Type II diabetes, or is susceptible to Type II diabetes. See, for 
example, Table 6, Table 7, Table 9, Table 11, Table 12 and 13 (below) for SNPs and 
markers that can form haplotypes that can be used as screening tools. These markers 
and SNPs can be identified in at-risk haploptypes. For example, an at-risk haplotype 
5 can include microsatellite markers and/or SNPs such as those set forth in Table 2, 
Table 4, Table 5 and Table 14. The presence of the haplotype is indicative a 
susceptibility to Type II diabetes, and therefore is indicative of an individual who falls 
within a target population for the treatment methods described herein. 

Haplotype analysis involves defining a candidate susceptibility locus using 

10 LOD scores. The defined regions are then ultra-fine mapped with microsatellite 
markers with an average spacing between markers of less than 100 kb. All usable 
microsatellite markers that found in public databases and mapped within that region 
can be used. In addition, microsatellite markers identified within the deCODE 
genetics sequence assembly of the human genome can be used. 

15 The frequencies of haplotypes in the patient and the control groups using an 

expectation-maximization algorithm can be estimated (Dempster A. et al. 9 1977. J. R. 
Stat. Soc. B, 39:1-389). An implementation of this algorithm that can handle missing 
genotypes and uncertainty with the phase can be used. Under the null hypothesis, the 
patients and the controls are assumed to have identical frequencies. Using a 

20 likelihood approach, an alternative hypothesis where a candidate at-risk-haplotype, 
which can include the KChPIl SNPs, is allowed to have a higher frequency in 
patients than controls, while the ratios of the frequencies of other haplotypes are 
assumed to be the same in both groups is tested. Likelihoods are maximized 
separately under both hypotheses and a corresponding 1-df likelihood ratio statistic is 

25 used to evaluate the statistic significance. 

To look for at-risk-haplotypes in the 1-lod drop, for example, association of all 
possible combinations of genotyped markers is studied, provided those markers span a 
practical region. The combined patient and control groups can be randomly divided 
into two sets, equal in size to the original group of patients and controls. The 

30 haplotype analysis is then repeated and the most significant p-value registered is 

determined. This randomization scheme can be repeated, for example, over 100 times 
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to construct an empirical distribution of p-values. In a preferred embodiment, a p- 
value of <0.05 is indicative of an at-risk haplotype. 

A detailed discussion of haplotype analysis follows. 

Haplotype analysis 

Our general approach to haplotype analysis involves using likelihood-based 
inference applied to NEsted MOdels. The method is implemented in our program 
NEMO, which allows for many polymorphic markers, SNPs and microsatellites. The 
method and software are specifically designed for case-control studies where the 
purpose is to identify haplotype groups that confer different risks. It is also a tool for 
studying LD structures. 

When investigating haplotypes constructed from many markers, apart from 
looking at each haplotype individually, meaningful summaries often require putting 
haplotypes into groups. A particular partition of the haplotype space is a model that 
assumes haplotypes within a group have the same risk, while haplotypes in different 
groups can have different risks. Two models/partitions are nested when one, the 
alternative model, is a finer partition compared to the other, the null model, Le, the 
alternative model allows some haplotypes assumed to have the same risk in the null 
model to have different risks. The models are nested in the classical sense that the null 
model is a special case of the alternative model. Hence traditional generalized 
likelihood ratio tests can be used to test the null model against the alternative model. 
Note that, with a multiplicative model, if haplotypes /*, and hj are assumed to have the 
same risk, it corresponds to assuming that f/pi = f/pj where / and p denote haplotype 
frequencies in the affected population and the control population respectively. 

One common way to handle uncertainty in phase and missing genotypes is a 
two-step method of first estimating haplotype counts and then treating the estimated 
counts as the exact counts, a method that can sometimes be problematic (e.g., see the 
information measure section below) and may require randomization to properly 
evaluate statistical significance. In NEMO, maximum likelihood estimates, likelihood 
ratios and p-values are calculated directly, with the aid of the EM algorithm, for the 
observed data treating it as a missing-data problem. 
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NEMO allows complete flexibility for partitions. For example, the first 
haplotype problem described in the Methods section on Statistical analysis considers 
testing whether h\ has the same risk as the other haplotypes h 2 , . . . , h k . Here the 
alternative grouping is [h\], [h 2 , h k ] and the null grouping is [h u h k ]. The 

5 second haplotype problem in the same section involves three haplotypes h\ = GO, h 2 = 
GX and h 3 = AX, and the focus is on comparing h\ and h 2 . The alternative grouping 
is [hi], [h 2 ], [h 3 ] and the null grouping is [h u h 2 ], [h 3 ]. If composite alleles exist, one 
could collapse these alleles into one at the data processing stage, and performed the 
test as described. This is a perfectly valid approach, and indeed, whether we collapse 

10 or not makes no difference if there were no missing information regarding phase. 
But, with the actual data, if each of the alleles making up a composite correlates 
differently with the SNP alleles, this will provide some partial information on phase. 
Collapsing at the data processing stage will unnecessarily increase the amount of 
missing information. A nested-models/partition framework can be used in this 

15 scenario. Let h 2 be split into h 2a , h 2b , . . . ., h 2e , and h 3 be split into h 3a , h 3b , . . ., h 3e . 

Then the alternative grouping is [hi], [h^, h 2b , . . . ., h 2e ], [h 3a , h 3b , . . h 3e ] and the null 
grouping is [h\ 9 h^, h 2b , . . . ., h 2e ], [h 3a , h 3b , . . ., h 3e ]. The same method can be used to 
handle composite where collapsing at the data processing stage is not even an option 
since L c represents multiple haplotypes constructed from multiple SNPs. 

20 Alternatively, a 3-way test with the alternative grouping of [h\] 9 [h 22L , h 2b , ....,h 2e ], 
[h 3a , h 3b , . . ., h 3e ] versus the null grouping of [h u h 2a , h 2b , . . . ., h 2e , h 3a , h 3by . . ., h 3e ] 
could also be performed. Note that the generalized likelihood ratio test-statistic would 
have two degrees of freedom instead of one. 

25 Measuring information 

Even though likelihood ratio tests based on likelihoods computed directly for 
the observed data, which have captured the information loss due to uncertainty in 
phase and missing genotypes, can be relied on to give valid p-values, it would still be 
of interest to know how much information had been lost due to the information being 

30 incomplete. Interestingly, one can measure information loss by considering a two- 
step procedure to evaluating statistical significance that appears natural but happens to 
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be systematically anti-conservative. Suppose we calculate the maximum likelihood 
estimates for the population haplotype frequencies calculated under the alternative 
hypothesis that there are differences between the affected population and control 
population, and use these frequency estimates as estimates of the observed 
5 frequencies of haplotype counts in the affected sample and in the control sample. 
Suppose we then perform a likelihood ratio test treating these estimated haplotype 
counts as though they are the actual counts. We could also perform a Fisher's exact 
test, but we would then need to round off these estimated counts since they are in 
general non-integers. This test will in general be anti-conservative because treating 

10 the estimated counts as if they were exact counts ignores the uncertainty with the 
counts, overestimates the effective sample size and underestimates the sampling 
variation. It means that the chi-square likelihood-ratio test statistic calculated this 
way, denoted by A*, will in general be bigger than A, the likelihood-ratio test-statistic 
calculated directly from the observed data as described in methods. But A* is useful 

15 because the ratio A/A* happens to be a good measure of information, or 1 - (A/A*) is 
a measure of the fraction of information lost due to missing information. This 
information measure for haplotype analysis is described in Nicolae and Kong, 
Technical Report 537, Department of Statistics, University of Statistics, University of 
Chicago, Revised for Biometrics (2003) as a natural extension of information 

20 measures defined for linkage analysis, and is implemented in NEMO. 

Statistical analysis. 

For single marker association to the disease, the Fisher exact test can be used to 
calculate two-sided p-values for each individual allele. All p-values are presented 

25 unadjusted for multiple comparisons unless specifically indicated. The presented 
frequencies (for microsatellites, SNPs and haplotypes) are allelic frequencies as 
opposed to carrier frequencies. To minimize any bias due the relatedness of the 
patients who were recruited as families for the linkage analysis, first and second- 
degree relatives can be eliminated from the patient list. Furthermore, the test can be 

30 repeated for association correcting for any remaining relatedness among the patients, 
by extending a variance adjustment procedure described in Risch, N. & Teng, J. 
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(Genome Res., 8: 1278-1288 (1998)). The relative power of family-based and case- 
control designs for linkage disequilibrium studies of complex human diseases I. DNA 
pooling, (ibid) for sibships so that it can be applied to general familial relationships, 
and present both adjusted and unadjusted p-values for comparison. The differences 
5 are in general very small as expected. To assess the significance of single-marker 
association corrected for multiple testing we carried out a randomisation test using the 
same genotype data. Cohorts of patients and controls can be randomized and the 
association analysis redone multiple times {e.g., up to 500,000 times) and the p-value 
is the fraction of replications that produced a p-value for some marker allele that is 
10 lower than or equal to the p-value we observed using the original patient and control 
cohorts. 

For both single-marker and haplotype analyses, relative risk (RR) and the 
population attributable risk (PAR) can be calculated assuming a multiplicative model 
(haplotype relative risk model), (Terwilliger, J.D. & Ott, J., Hum Hered, 42, 337-46 

15 (1992) and Falk, C.T. & Rubinstein, P, Ann Hum Genet 51 ( Pt 3), 227-33 (1987)), 
i.e., that the risks of the two alleles/haplotypes a person carries multiply. For 
example, if RR is the risk of A relative to a, then the risk of a person homozygote AA 
will be RR times that of a heterozygote Aa and RR 2 times that of a homozygote aa. 
The multiplicative model has a nice property that simplifies analysis and 

20 computations — haplotypes are independent, i.e., in Hardy- Weinberg equilibrium, 
within the affected population as well as within the control population. As a 
consequence, haplotype counts of the affecteds and controls each have multinomial 
distributions, but with different haplotype frequencies under the alternative 
hypothesis. Specifically, for two haplotypes h t and h h risk^/risk^,) = (fi/pd/(fj/pj) 9 

25 where /and p denote respectively frequencies in the affected population and in the 
control population. While there is some power loss if the true model is not 
multiplicative, the loss tends to be mild except for extreme cases. Most importantly, 
p-values are always valid since they are computed with respect to null hypothesis. 
In general, haplotype frequencies are estimated by maximum likelihood and 

30 tests of differences between cases and controls are performed using a generalized 
likelihood ratio test (Rice, J. A. Mathematical Statistics and Data Analysis, 602 
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(International Thomson Publishing, (1995)). deCODE's haplotype analysis program 
called NEMO, which stands for NEsted MOdels, can be used to calculate all the 
haplotype results. To handle uncertainties with phase and missing genotypes, it is 
emphasized that we do not use a common two-step approach to association tests, 
5 where haplotype counts are first estimated, possibly with the use of the EM algorithm, 
Dempster, (A.P., Laird, N.M. & Rubin, D.B., Journal of the Royal Statistical Society 
B, 39, 1-38 (1971)) and then tests are performed treating the estimated counts as 
though they are true counts, a method that can sometimes be problematic and may 
require randomisation to properly evaluate statistical significance. Instead, with 

10 NEMO, maximum likelihood estimates, likelihood ratios and p-values are computed 
with the aid of the EM-algorithm directly for the observed data, and hence the loss of 
information due to uncertainty with phase and missing genotypes is automatically 
captured by the likelihood ratios. Even so, it is of interest to know how much 
information is retained, or lost, due to incomplete information. Described herein is 

15 such a measure that is natural under the likelihood framework. For a fixed set of 
markers, the simplest tests performed compare one selected haplotype against all the 
others. Call the selected haplotype h\ and the others h 2 , h k . Let/?], . . pk denote 
the population frequencies of the haplotypes in the controls, and fu ... 9 fk denote the 
population frequencies of the haplotypes in the affecteds. Under the null hypothesis, 

20 fi =pi for all i. The alternative model we use for the test assumes h 2 , hk to have the 
same risk while h\ is allowed to have a different risk. This implies that while p\ can be 
different from f u f}/(f 2 +...+f k ) =pi4p 2 +.. .+;?*) = ft for / = 2, k, Denoting/i//?, byr, 
and noting that . = 1, the test statistic based on generalized likelihood ratios is 

25 where £ denotes logelikelihood and ~ and A denote maximum likelihood estimates 
under the null hypothesis and alternative hypothesis respectively. A has 
asymptotically a chi-square distribution with 1-df, under the null hypothesis. Slightly 
more complicated null and alternative hypotheses can also be used. For example, let 
h\ be GO, h 2 be GX and /z 3 be AX. When comparing GO against GX, i.e., this is the 

30 test which gives estimated RR of 1 .46 and p-value = 0.0002, the null assumes GO and 
GX have the same risk but AX is allowed to have a different risk. The alternative 
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hypothesis allows, for example, three haplotype groups to have different risks. This 
implies that, under the null hypothesis, there is a constraint that fy/p } =f 2 /p 2 , or w = 
\f\/p\V\h/pi] = 1 - The test statistic based on generalized likelihood ratios is 
A = 2 [tipiJufaw) - tfaJufiA)] 
5 that again has asymptotically a chi-square distribution with 1 -df under the null 

hypothesis. If there are composite haplotypes (for example, h 2 and A 3 ), that is handled 
in a natural manner under the nested models framework. 

LD between pairs of SNPs can be calculated using the standard definition of 
D* and R 2 (Lewontin, R., Genetics 49, 49-67 (1964) and Hill, W.G. & Robertson, A. 

10 Theor. Appl. Genet. 22, 226-231 (1968)).Using NEMO, frequencies of the two 

marker allele combinations are estimated by maximum likelihood and deviation from 
linkage equilibrium is evaluated by a likelihood ratio test. The definitions of D' and 
R are extended to include microsatellites by averaging over the values for all possible 
allele combination of the two markers weighted by the marginal allele probabilities. 

15 When plotting all marker combination to elucidate the LD structure in a particular 
region, we plot D' in the upper left comer and the p-value in the lower right corner. 
In the LD plots the markers can be plotted equidistant rather than according to their 
physical location, if desired. 

20 Statistical Methods for Linkage A nalysis 

Multipoint, affected-only allele-sharing methods can be used in the analyses to 
assess evidence for linkage. Results, both the LOD-score and the non-parametric 
linkage (NPL) score, can be obtained using the program Allegro (Gudbjartsson et al., 
Nat. Genet. 25:12-3, 2000). Our baseline linkage analysis uses the Spairs scoring 

25 function (Whittemore, A.S., Halpern, J. (1994), Biometrics 50:1 18-27; Kruglyak L, et 
al. (1996), Am J Hum Genet 55:1347-63), the exponential allele-sharing model 
(Kong, A. and Cox, N.J. (1997), Am J Hum Genet 61: 1 179-88) and a family 
weighting scheme that is halfway, on the log-scale, between weighting each affected 
pair equally and weighting each family equally. The information measure we use is 

30 part of the Allegro program output and the information value equals zero if the marker 
genotypes are completely uninformative and equals one if the genotypes determine 
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the exact amount of allele sharing by decent among the affected relatives 
(Gretarsdottir et al, Am. J. Horn. Genet, 70:593-603, (2002)). We computed the P- 
values two different ways and here report the less significant result. The first P-value 
can be computed on the basis of large sample theory; the distribution of Zj r = 
5 V(2[loge(10)LOD]) approximates a standard normal variable under the null 

hypothesis of no linkage (Kong, A. and Cox, N.J. (1997), Am J Hum Genet (57:1 179- 
88). The second P-value can be calculated by comparing the observed LOD-score 
with its complete data sampling distribution under the null hypothesis {e.g., 
Gudbjartsson et al, Nat. Genet. 25:12-3, 2000). When the data consist of more than a 

10 few families, these two P- values tend to be very similar. 
NUCLEIC ACID THERAPEUTIC AGENTS 

In another embodiment, a nucleic acid of the invention; a nucleic acid 
complementary to a nucleic acid of the invention; or a portion of such a nucleic acid 
{e.g., an oligonucleotide as described below); or a nucleic acid encoding a KChlPl 

15 polypeptide, can be used in "antisense" therapy, in which a nucleic acid {e.g., an 

oligonucleotide) which specifically hybridizes to the mRNA and/or genomic DNA of 
a nucleic acid is administered or generated in situ. The antisense nucleic acid that 
specifically hybridizes to the mRNA and/or DNA inhibits expression of the 
polypeptide encoded by that mRNA and/or DNA, e.g., by inhibiting translation and/or 

20 transcription. Binding of the antisense nucleic acid can be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes, through 
specific interaction in the major groove of the double helix. 

An antisense construct can be delivered, for example, as an expression 
plasmid as described above. When the plasmid is transcribed in the cell, it produces 

25 RNA that is complementary to a portion of the mRNA and/or DNA that encodes a 
KChlPl polypeptide. Alternatively, the antisense construct can be an oligonucleotide 
probe that is generated ex vivo and introduced into cells; it then inhibits expression by 
hybridizing with the mRNA and/or genomic DNA of the polypeptide. In one 
embodiment, the oligonucleotide probes are modified oligonucleotides that are 

30 resistant to endogenous nucleases, e.g., exonuc leases and/or endonucleases, thereby 
rendering them stable in vivo. Exemplary nucleic acid molecules for use as antisense 
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oligonucleotides are phosphoramidate, phosphothioate and methylphosphonate 
analogs of DNA (see also U.S. Patent Nos. 5,176,996, 5,264,564 and 5,256,775). 
Additionally, general approaches to constructing oligomers useful in antisense therapy 
are also described, for example, by Van der Krol et al. {BioTechniques 6:958-976 
5 (1988)); and Stein et al {Cancer Res. 48:2659-2668 (1988)). With respect to 

antisense DNA, oligodeoxyribonucleotides derived from the translation initiation site 
are preferred. 

To perform antisense therapy, oligonucleotides (mRNA, cDNA or DNA) are 
designed that are complementary to mRNA encoding the polypeptide. The antisense 

10 oligonucleotides bind to mRNA transcripts and prevent translation. Absolute 

complementarity, although preferred, is not required. A sequence "complementary" 
to a portion of an RNA, as referred to herein, indicates that a sequence has sufficient 
complementarity to be able to hybridize with the RNA, forming a stable duplex; in the 
case of double-stranded antisense nucleic acids, a single strand of the duplex DNA 

15 may thus be tested, or triplex formation may be assayed. The ability to hybridize will 
depend on both the degree of complementarity and the length of the antisense nucleic 
acid, as described in detail above. Generally, the longer the hybridizing nucleic acid, 
the more base mismatches with an RNA it may contain and still form a stable duplex 
(or triplex, as the case may be). One skilled in the art can ascertain a tolerable degree 

20 of mismatch by use of standard procedures. 

The oligonucleotides used in antisense therapy can be DNA, RNA, or 
chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotides can be modified at the base moiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule, 

25 hybridization, etc. The oligonucleotides can include other appended groups such as 
peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al, Proc. Natl Acad. Sci. USA 
86:6553-6556 (1989); Lemaitre et al, Proc. Natl Acad. Sci. USA 84:648-652 (1987); 
PCT International Publication NO: WO 88/09810) or the blood-brain barrier (see, 

30 e.g. , PCT International Publication NO: WO 89/101 34), or hybridization-triggered 
cleavage agents (see, e.g., Krol et al, BioTechniques 6:958-976 (1988)) or 
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intercalating agents. (See, e.g., Zon, Pharm.Res. 5: 539-549 (1988)). To this end, the 
oligonucleotide may be conjugated to another molecule (e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent). 
The antisense molecules are delivered to cells that express a KChlPl 
5 polypeptide in vivo. A number of methods can be used for delivering antisense DNA 
or RNA to cells; e.g., antisense molecules can be injected directly into the tissue site, 
or modified antisense molecules, designed to target the desired cells (e.g., antisense 
linked to peptides or antibodies that specifically bind receptors or antigens expressed 
on the target cell surface) can be administered systematically. Alternatively, in a 

10 another embodiment, a recombinant DNA construct is utilized in which the antisense 
oligonucleotide is placed under the control of a strong promoter (e.g., pol III or pol 
II). The use of such a construct to transfect target cells in the patient results in the 
transcription of sufficient amounts of single stranded RNAs that will form 
complementary base pairs with the endogenous transcripts and thereby prevent 

15 translation of the mRNA. For example, a vector can be introduced in vivo such that it 
is taken up by a cell and directs the transcription of an antisense RNA. Such a vector 
can remain episomal or become chromosomally integrated, as long as it can be 
transcribed to produce the desired antisense RNA. Such vectors can be constructed 
by recombinant DNA technology methods standard in the art and described above. 

20 For example, a plasmid, cosmid, YAC or viral vector can be used to prepare the 
recombinant DNA construct that can be introduced directly into the tissue site. 
Alternatively, viral vectors can be used which selectively infect the desired tissue, in 
which case administration may be accomplished by another route (e.g., systemically). 
In another embodiment of the invention, small double-stranded interfering RNA 

25 (RNA interference (RNAi)) can be used. RNAi is a post-transcription process, in 
which double-stranded RNA is introduced, and sequence-specific gene silencing 
results, though catalytic degradation of the targeted mRNA. See, e.g., Elbashir, S.M. 
et al., Nature 4/7:494-498 (2001); Lee, N.S., Nature Biotech. 79:500-505 (2002); 
Lee, S-K. et al., Nature Medicine 5^:681-686 (2002); the entire teachings of these 

30 references are incorporated herein by reference. 
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RNAi is used routinely to investigate gene function in a high throughput 
fashion or to modulate gene expression in human diseases (Chi et ai, PNAS,100 
(ll):6343-6346 (2003)). 

Introduction of long double standed RNA leads to sequence-specific 
5 degradation of homologous gene transcripts. The long double stranded RNA is 
metabolized to small 21-23 nucleotide siRNA (small interfering RNA). The siRNA 
then binds to protein complex RISC (RNA-induced silencing complex) with dual 
function helicase. The helicase has RNAas activity and is able to unwind the RNA. 
The unwound si RNA allows an antisense strand to bind to a target. This results in 
10 sequence dependent degradation of cognate mRNA. Aside from endogenous RNAi, 
exogenous RNAi, chemically synthesized or recombinantly produced can also be 
used. 

Using non-intronic portions of the KChlPl gene such as corresponding mRNA 
portions of SEQ ID NO: 1 , target regions of the KChlPl gene that are accessible for 

15 RNAi are targeted and silenced. With this technique it is possible to conduct a RNAi 
gene walk of the nucleic acids of KChlPl and determine the amount of inhibition of 
the protein product. Thus, it is possible to design gene-specific therapeutics by 
directly targeting the mRNAs of Type II diabetes-related KChlPl gene. 

Endogenous expression of a gene product can also be reduced by inactivating 

20 or "knocking out" the gene or its promoter using targeted homologous recombination 
(e.g., see Smithies et al. 9 Nature 317:230-234 (1985); Thomas & Capecchi, Cell 
51:503-512 (1987); Thompson et ai, Cell 5:313-321 (1989)). For example, an 
altered, non-functional gene (or a completely unrelated DNA sequence) flanked by 
DNA homologous to the endogenous gene (either the coding regions or regulatory 

25 regions of the gene) can be used, with or without a selectable marker and/or a 

negative selectable marker, to transfect cells that express the gene in vivo. Insertion 
of the DNA construct, via targeted homologous recombination, results in inactivation 
of the gene. The recombinant DNA constructs can be directly administered or 
targeted to the required site in vivo using appropriate vectors, as described above. 

30 Alternatively, expression of non-altered genes can be increased using a similar 

method: targeted homologous recombination can be used to insert a DNA construct 
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comprising a non-altered functional gene, or the complement thereof, or a portion 
thereof, in place of an gene in the cell, as described above. In another embodiment, 
targeted homologous recombination can be used to insert a DNA construct 
comprising a nucleic acid that encodes a polypeptide variant that differs from that 
5 present in the cell. 

Alternatively, endogenous expression of a gene product can be reduced by 
targeting deoxyribonucleotide sequences complementary to the regulatory region (i.e., 
the promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the gene in target cells in the body. (See generally, Helene, C, 

10 Anticancer Drug Des., 6(6):569-84 (1991); Helene, C. etaL,Ann. MY. Acad. Sci. 
660:27-36 (1992); and Maher, L. J., Bioassays 14(12):807-15 (1992)). Likewise, the 
antisense constructs described herein, by antagonizing the normal biological activity 
of the gene product, can be used in the manipulation of tissue, e.g., tissue 
differentiation, both in vivo and for ex vivo tissue cultures. Furthermore, the anti- 

15 sense techniques (e.g., microinjection of antisense molecules, or transfection with 
plasmids whose transcripts are anti-sense with regard to a nucleic acid RNA or 
nucleic acid sequence) can be used to investigate the role of one or more members of 
the KChlPl pathway in the development of disease-related conditions. Such 
techniques can be utilized in cell culture, but can also be used in the creation of 

20 transgenic animals. 

The therapeutic agents as described herein can be delivered in a composition, 
as described above, or alone. They can be administered systemically, or can be 
targeted to a particular tissue. The therapeutic agents can be produced by a variety of 
means, including chemical synthesis; recombinant production; in vivo production 

25 (e.g. , a transgenic animal, such as U.S. Patent NO: 4,873,3 1 6 to Meade et al), for 
example, and can be isolated using standard means such as those described herein. In 
addition, a combination of any of the above methods of treatment (e.g., administration 
of non-altered polypeptide in conjunction with antisense therapy targeting altered 
mRNA; administration of a first splicing variant in conjunction with antisense therapy 

30 targeting a second splicing variant) can also be used. 
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The invention additionally pertains to use of such therapeutic agents, as 
described herein, for the manufacture of a medicament for the treatment of Type II 
diabetes e.g., using the methods described herein. 

5 MONITORING PROGRESS OF TREATMENT 

The current invention also pertains to methods of monitoring the effectiveness 
of treatment on the regulation of expression (e.g., relative or absolute expression) of 
one or more KChlPl isoforms at the RNA or protein level or its enzymatic activity. 
KChlPl message or protein or enzymatic activity can be measured in a sample of 

10 peripheral blood or cells derived therefrom. An assessment of the levels of expression 
or activity can be made before and during treatment with KChlPl therapeutic agents. 
For example, in one embodiment of the invention, an individual who is a member of 
the target population can be assessed for response to treatment with a KChlPl 
inhibitor, by examining calcium levels or Kv channel-interacting proteins activity or 

15 absolute and/or relative levels of KChlPl protein or mRNA isoforms in peripheral 
blood in general or specific cell subtractions or combination of cell subfractions. In 
addition, variation such as haplotypes or mutations within or near (within 100 to 
200kb) of the KChlPl gene may be used to identify individuals who are at higher risk 
for Type II diabetes to increase the power and efficiency of clinical trials for 

20 pharmaceutical agents to prevent or treat Type II diabetes. The haplotypes and other 
variations may be used to exclude or fractionate patients in a clinical trial who are 
likely to have non- KChlPl involvement in their Type II diabetes risk in order to 
enrich patients who have other genes or pathways involved and boost the power and 
sensitivity of the clinical trial. Such variation may be used as a pharmacogenomic test 

25 to guide selection of pharmaceutical agents for individuals. 

Described herein is the first known linkage study of Type II diabetes showing 
a connection to chromosome 5q35. Based on the linkage studies conducted, a direct 
relationship between Type II diabetes and the locus on chromosome 5q35, in 
particular the KChlPl gene, has been discovered. 



30 
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NUCLEIC ACIDS OF THE INVENTION 
KChlPl Nucleic Acids, Portions and Variants 

Accordingly, the invention pertains to isolated nucleic acid molecules 
comprising human KChlPl nucleic acid. The term, "KChlPl nucleic acid," as used 
5 herein, refers to an isolated nucleic acid molecule encoding a KChlPl polypeptide 
{e.g., a KChlPl gene, such as shown in SEQ ID NO:l). The KChlPl nucleic acid 
molecules of the present invention can be RNA, for example, mRNA, or DNA, such 
as cDNA and genomic DNA. DNA molecules can be double-stranded or single- 
stranded; single stranded RNA or DNA can be either the coding, or sense, strand or 

10 the non-coding, or antisense strand. The nucleic acid molecule can include all or a 
portion of the coding sequence of the gene and can further comprise additional non- 
coding sequences such as introns and non-coding 3 ? and 5 ! sequences (including 
regulatory sequences, for example). 

For example, the KChlPl nucleic acid can the genomic sequence shown in 

15 FIG. 1, or a portion or fragment of the isolated nucleic acid molecule {e.g., cDNA or 
the gene) that encodes KChlPl polypeptide. In certain embodiments, the isolated 
nucleic acid molecule comprises a nucleic acid molecule selected from the group 
consisting of SEQ ID NOs: 1 and 1 14-258 {e.g., in Table 10) or the complement of 
such a nucleic acid molecule. 

20 Additionally, nucleic acid molecules of the invention can be fused to a marker 

sequence, for example, a sequence that encodes a polypeptide to assist in isolation or 
purification of the polypeptide. Such sequences include, but are not limited to, those 
that encode a glutathione-S-transferase (GST) fusion protein and those that encode a 
hemagglutinin A (HA) polypeptide marker from influenza. 

25 An "isolated" nucleic acid molecule, as used herein, is one that is separated 

from nucleic acids that normally flank the gene or nucleotide sequence (as in genomic 
sequences) and/or has been completely or partially purified from other transcribed 
sequences {e.g., as in an RNA library). For example, an isolated nucleic acid of the 
invention may be substantially isolated with respect to the complex cellular milieu in 

30 which it naturally occurs, or culture medium when produced by recombinant 

techniques, or chemical precursors or other chemicals when chemically synthesized. 
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In some instances, the isolated material will form part of a composition (for example, 
a crude extract containing other substances), buffer system or reagent mix. In other 
circumstances, the material may be purified to essential homogeneity, for example as 
determined by PAGE or column chromatography such as HPLC. Preferably, an 
5 isolated nucleic acid molecule comprises at least about 50, 80 or 90% (on a molar 
basis) of all macromolecular species present. With regard to genomic DNA, the term 
"isolated" also can refer to nucleic acid molecules that are separated from the 
chromosome with which the genomic DNA is naturally associated. For example, the 
isolated nucleic acid molecule can contain less than about 5 kb but not limited to 4 kb, 

10 3 kb, 2 kb, 1 kb, 0.5 kb or 0. 1 kb of nucleotides which flank the nucleic acid molecule 
in the genomic DNA of the cell from which the nucleic acid molecule is derived. 

The nucleic acid molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. Thus, recombinant DNA contained in a 
vector is included in the definition of "isolated" as used herein. Also, isolated nucleic 

15 acid molecules include recombinant DNA molecules in heterologous host cells, as 
well as partially or substantially purified DNA molecules in solution. "Isolated" 
nucleic acid molecules also encompass in vivo and in vitro RNA transcripts of the 
DNA molecules of the present invention. An isolated nucleic acid molecule can 
include a nucleic acid molecule or nucleic acid sequence that is synthesized 

20 chemically or by recombinant means. Therefore, recombinant DNA contained in a 
vector is included in the definition of "isolated" as used herein. Also, isolated nucleic 
acid molecules include recombinant DNA molecules in heterologous organisms, as 
well as partially or substantially purified DNA molecules in solution. In vivo and in 
vitro RNA transcripts of the DNA molecules of the present invention are also 

25 encompassed by "isolated" nucleic acid sequences. Such isolated nucleic acid 
molecules are useful in the manufacture of the encoded polypeptide, as probes for 
isolating homologous sequences (e.g., from other mammalian species), for gene 
mapping {e.g., by in situ hybridization with chromosomes), or for detecting 
expression of the gene in tissue {e.g., human tissue), such as by Northern blot 

30 analysis. 
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The present invention also pertains to nucleic acid molecules which are not 
necessarily found in nature but which encode a KChlPl polypeptide, or another 
splicing variant of a KChlPl polypeptide or polymorphic variant thereof. Thus, for 
example, the invention pertains to DNA molecules comprising a sequence that is 
different from the naturally occurring nucleotide sequence but which, due to the 
degeneracy of the genetic code, encode a KChlPl polypeptide of the present 
invention. The invention also encompasses nucleic acid molecules encoding portions 
(fragments), or encoding variant polypeptides such as analogues or derivatives of a 
KChlPl polypeptide. Such variants can be naturally occurring, such as in the case of 
allelic variation or single nucleotide polymorphisms, or non-naturally-occurring, such 
as those induced by various mutagens and mutagenic processes. Intended variations 
include, but are not limited to, addition, deletion and substitution of one or more 
nucleotides that can result in conservative or non-conservative amino acid changes, 
including additions and deletions. Preferably the nucleotide (and/or resultant amino 
acid) changes are silent or conserved; that is, they do not alter the characteristics or 
activity of a KChlPl polypeptide. In one embodiment, the nucleic acid sequences are 
fragments that comprise one or more polymorphic microsatellite markers. In another 
embodiment, the nucleotide sequences are fragments that comprise one or more single 
nucleotide polymorphisms in a KChlPl gene. 

Other alterations of the nucleic acid molecules of the invention can include, 
for example, labeling, methylation, internucleotide modifications such as uncharged 
linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates), 
charged linkages (e.g., phosphorothioates, phosphorodithioates), pendent moieties 
(e.g., polypeptides), intercalators (e.g., acridine, psoralen), chelators, alkylators, and 
modified linkages (e.g., alpha anomeric nucleic acids). Also included are synthetic 
molecules that mimic nucleic acid molecules in the ability to bind to a designated 
sequence via hydrogen bonding and other chemical interactions. Such molecules 
include, for example, those in which peptide linkages substitute for phosphate 
linkages in the backbone of the molecule. 

The invention also pertains to nucleic acid molecules that hybridize under high 
stringency hybridization conditions, such as for selective hybridization, to a 
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nucleotide sequence described herein (e.g., nucleic acid molecules which specifically 
hybridize to a nucleotide sequence encoding polypeptides described herein, and, 
optionally, have an activity of the polypeptide). In one embodiment, the invention 
includes variants described herein which hybridize under high stringency 
5 hybridization conditions (e.g., for selective hybridization) to a nucleotide sequence 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 
1 14-258. In another embodiment, the invention includes variants described herein 
that hybridize under high stringency hybridization conditions (e.g., for selective 
hybridization) to a nucleotide sequence encoding an amino acid sequence or a 

10 polymorphic variant thereof. In another embodiment, the variant that hybridizes 
under high stringency hybridizations has an activity of a KChlPl polypeptide. 

Such nucleic acid molecules can be detected and/or isolated by specific 
hybridization (e.g., under high stringency conditions). "Specific hybridization," as 
used herein, refers to the ability of a first nucleic acid to hybridize to a second nucleic 

15 acid in a manner such that the first nucleic acid does not hybridize to any nucleic acid 
other than to the second nucleic acid (e.g., when the first nucleic acid has a higher 
similarity to the second nucleic acid than to any other nucleic acid in a sample 
wherein the hybridization is to be performed). "Stringency conditions" for 
hybridization is a term of art which refers to the incubation and wash conditions, e.g., 

20 conditions of temperature and buffer concentration, which permit hybridization of a 
particular nucleic acid to a second nucleic acid; the first nucleic acid may be perfectly 
(i.e., 100%) complementary to the second, or the first and second may share some 
degree of complementarity which is less than perfect (e.g., 70%, 75%, 85%, 90%, 
95%). For example, certain high stringency conditions can be used which distinguish 

25 perfectly complementary nucleic acids from those of less complementarity. "High 
stringency conditions", "moderate stringency conditions" and "low stringency 
conditions" for nucleic acid hybridizations are explained on pages 2.10.1-2.10:16 and 
pages 6.3.1-6.3.6 in Current Protocols in Molecular Biology (Ausubel, F.M. et aL, 
"Current Protocols in Molecular Biology ", John Wiley & Sons, (2001)), the entire 

30 teachings of which are incorporated by reference herein). The exact conditions which 
determine the stringency of hybridization depend not only on ionic strength (e.g., 
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0.2X SSC, 0.1X SSC), temperature (e.g., room temperature, 42°C, 68°C) and the 
concentration of destabilizing agents such as formamide or denaturing agents such as 
SDS, but also on factors such as the length of the nucleic acid sequence, base 
composition, percent mismatch between hybridizing sequences and the frequency of 

5 occurrence of subsets of that sequence within other non-identical sequences. Thus, 
equivalent conditions can be determined by varying one or more of these parameters 
while maintaining a similar degree of identity or similarity between the two nucleic 
acid molecules. Typically, conditions are used such that sequences at least about 
60%, at least about 70%, at least about 80%, at least about 90%, or at least about 95% 

10 or more identical to each other remain hybridized to one another. By varying 

hybridization conditions from a level of stringency at which no hybridization occurs 
to a level at which hybridization is first observed, conditions which will allow a given 
sequence to hybridize (e.g., selectively) with the most similar sequences in the sample 
can be determined. 

15 Exemplary conditions are described in Krause, M.H. and S.A. Aaronson, 

Methods in Enzymology 200:546-556 (1991), and in, Ausubel, et al., "Current 
Protocols in Molecular Biology", John Wiley & Sons, (2001), which describes the 
determination of washing conditions for moderate or low stringency conditions. 
Washing is the step in which conditions are usually set so as to determine a minimum 

20 level of complementarity of the hybrids. Generally, starting from the lowest 

temperature at which only homologous hybridization occurs, each °C by which the 
final wash temperature is reduced (holding SSC concentration constant) allows an 
increase by 1% in the maximum extent of mismatching among the sequences that 
hybridize. Generally, doubling the concentration of SSC results in an increase in T m 

25 of - 1 7°C. Using these guidelines, the washing temperature can be determined 

empirically for high, moderate or low stringency, depending on the level of mismatch 
sought. 

For example, a low stringency wash can comprise washing in a solution 
containing 0.2X SSC/0.1% SDS for 10 minutes at room temperature; a moderate 
30 stringency wash can comprise washing in a pre-warmed solution (42°C) solution 
containing 0.2X SSC/0.1% SDS for 15 minutes at 42°C; and a high stringency wash 



-39- 



can comprise washing in pre-warmed (68°C) solution containing 0.1X SSC/0.1%SDS 
for 15 minutes at 68°C. Furthermore, washes can be performed repeatedly or 
sequentially to obtain a desired result as known in the art. Equivalent conditions can 
be determined by varying one or more of the parameters given as an example, as 
known in the art, while maintaining a similar degree of identity or similarity between 
the target nucleic acid molecule and the primer or probe used. 

The percent homology or identity of two nucleotide or amino acid sequences 
can be determined by aligning the sequences for optimal comparison purposes (e.g., 
gaps can be introduced in the sequence of a first sequence for optimal alignment). 
The nucleotides or amino acids at corresponding positions are then compared, and the 
percent identity between the two sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # of identical positions/total # of 
positions x 100). When a position in one sequence is occupied by the same nucleotide 
or amino acid residue as the corresponding position in the other sequence, then the 
molecules are homologous at that position. As used herein, nucleic acid or amino 
acid "homology" is equivalent to nucleic acid or amino acid "identity". In certain 
embodiments, the length of a sequence aligned for comparison purposes is at least 
30%, for example, at least 40%, in certain embodiments at least 60%, and in other 
embodiments at least 70%, 80%, 90% or 95% of the length of the reference sequence. 
The actual comparison of the two sequences can be accomplished by well-known 
methods, for example, using a mathematical algorithm. A preferred, non-limiting 
example of such a mathematical algorithm is described in Karlin et al, Proc. Natl. 
Acad. Sci. USA 90:5873-5877 (1993). Such an algorithm is incorporated into the 
NBLAST and XBLAST programs (version 2.0) as described in Altschul et al, 
Nucleic Acids Res. 25:389-3402 (1997). When utilizing BLAST and Gapped BLAST 
programs, the default parameters of the respective programs (e.g., NBLAST) can be 
used. In one embodiment, parameters for sequence comparison can be set at 
score=100, wordlength=12, or can be varied (e.g., W=5 or W=20). 

Another preferred, non-limiting example of a mathematical algorithm utilized 
for the comparison of sequences is the algorithm of Myers and Miller, CABIOS 4(1): 
11-17 (1988). Such an algorithm is incorporated into the ALIGN program (version 
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2.0) which is part of the GCG sequence alignment software package (Accelrys, 
Cambridge, UK). When utilizing the ALIGN program for comparing amino acid 
sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap 
penalty of 4 can be used. Additional algorithms for sequence analysis are known in 

5 the art and include ADVANCE and ADAM as described in Torellis and Robotti, 
Comput. Appl BioscL 10:3-5 (1994); and FASTA described in Pearson and Lipman, 
Proc. Natl Acad. Set USA 85:2444-8 (1988). 

In another embodiment, the percent identity between two amino acid 
sequences can be accomplished using the GAP program in the GCG software package 

10 using either a BLOSUM63 matrix or a PAM250 matrix, and a gap weight of 12, 10, 8, 
6, or 4 and a length weight of 2, 3, or 4. In yet another embodiment, the percent 
identity between two nucleic acid sequences can be accomplished using the GAP 
program in the GCG software package using a gap weight of 50 and a length weight 
of3. 

15 The present invention also provides isolated nucleic acid molecules that 

contain a fragment or portion that hybridizes under highly stringent conditions to a 
nucleotide sequence comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs: 1, 1 14-258, or the complement of such a sequence, and 
also provides isolated nucleic acid molecules that contain a fragment or portion that 

20 hybridizes under highly stringent conditions to a nucleotide sequence encoding an 
amino acid sequence or polymorphic variant thereof. The nucleic acid fragments of 
the invention are at least about 15, preferably at least about 18, 20, 23 or 25 
nucleotides, and can be 30, 40, 50, 100, 200 or more nucleotides in length. Longer 
fragments, for example, 30 or more nucleotides in length, that encode antigenic 

25 polypeptides described herein are particularly useful, such as for the generation of 
antibodies as described below. 

Probes and Primers 

In a related aspect, the nucleic acid fragments of the invention are used as 
30 probes or primers in assays such as those described herein. "Probes" or "primers" are 
oligonucleotides that hybridize in a base-specific manner to a complementary strand 
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of nucleic acid molecules. Such probes and primers include polypeptide nucleic 
acids, as described in Nielsen et aL, Science 254:1497-1500 (1991). 

A probe or primer comprises a region of nucleotide sequence that hybridizes 
to at least about 15, for example about 20-25, and in certain embodiments about 40, 
50 or 75, consecutive nucleotides of a nucleic acid molecule comprising a contiguous 
nucleotide sequence selected from the group consisting of SEQ ID NOs: 1, 1 14-258 
or polymorphic variant thereof In other embodiments, a probe or primer comprises 
100 or fewer nucleotides, in certain embodiments from 6 to 50 nucleotides, for 
example from 12 to 30 nucleotides. In other embodiments, the probe or primer is at 
least 70% identical to the contiguous nucleotide sequence or to the complement of the 
contiguous nucleotide sequence, for example at least 80% identical, in certain 
embodiments at least 90% identical, and in other embodiments at least 95% identical, 
or even capable of selectively hybridizing to the contiguous nucleotide sequence or to 
the complement of the contiguous nucleotide sequence. Often, the probe or primer 
further comprises a label, e.g., radioisotope, fluorescent compound, enzyme, or 
enzyme co-factor. 

The nucleic acid molecules of the invention such as those described above can 
be identified and isolated using standard molecular biology techniques and the 
sequence information provided herein. For example, nucleic acid molecules can be 
amplified and isolated by the polymerase chain reaction using synthetic 
oligonucleotide primers designed based on one or more of the sequences selected 
from the group consisting of SEQ ID NOs: 1, 1 14-258 or the complement of such a 
sequence, or designed based on nucleotides based on sequences encoding one or more 
of the amino acid sequences provided herein. See generally PCR Technology: 
Principles and Applications for DNA Amplification (ed. H.A. Erlich, Freeman Press, 
NY, NY, 1992); PCR Protocols: A Guide to Methods and Applications (Eds. Innis et 
al. 9 Academic Press, San Diego, CA, 1990); Mattila et aL, Nucl. Acids Res. 19: 4967 
(1991); Eckert et aL, PCR Methods and Applications 1:17 (1991); PCR (eds. 
McPherson et aL, IRL Press, Oxford); and U.S. Patent 4,683,202. The nucleic acid 
molecules can be amplified using cDNA, mRNA or genomic DNA as a template, 
cloned into an appropriate vector and characterized by DNA sequence analysis. 
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Other suitable amplification methods include the ligase chain reaction (LCR) 
(see Wu and Wallace, Genomics 4:560 (1989), Landegren et aL 9 Science 241:1077 

(1988) , transcription amplification (Kwoh etal, Proc. Natl. Acad. Sci. USA 86:1 173 

(1989) ), and self-sustained sequence replication (Guatelli et al, Proc. Nat Acad. Sci. 
USA 87:1874 (1990)) and nucleic acid based sequence amplification (NASBA). The 
latter two amplification methods involve isothermal reactions based on isothermal 
transcription, which produce both single stranded RNA (ssRNA) and double stranded 
DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1, 
respectively. 

The amplified DNA can be labeled, for example, radiolabeled, and used as a 
probe for screening a cDNA library derived from human cells, mRNA in zap express, 
ZIPLOX or other suitable vector. Corresponding clones can be isolated, DNA can 
obtained following in vivo excision, and the cloned insert can be sequenced in either 
or both orientations by art recognized methods to identify the correct reading frame 
encoding a polypeptide of the appropriate molecular weight. For example, the direct 
analysis of the nucleotide sequence of nucleic acid molecules of the present invention 
can be accomplished using well-known methods that are commercially available. 
See, for example, Sambrook et aL, Molecular Cloning, A Laboratory Manual (2nd 
Ed., CSHP, New York 1 989); Zyskind et al., Recombinant DNA Laboratory Manual, 
(Acad. Press, 1988)). Additionally, fluorescence methods are also available for 
analyzing nucleic acids (Chen et al t Genome Res. 9, 492 (1999)) and polypeptides. 
Using these or similar methods, the polypeptide and the DNA encoding the 
polypeptide can be isolated, sequenced and further characterized. 

Antisense nucleic acid molecules of the invention can be designed using the 
nucleotide sequences of one or more of SEQ ID NOs: 1, 1 14-258 and/or the 
complement of one or more of SEQ ID NOs: 1 , 1 14-258 and/or a portion of one or 
more of SEQ ID NOs: 1, 1 14-258 or the complement of one or more of SEQ ID NOs: 
1, 1 14-258 and constructed using chemical synthesis and enzymatic ligation reactions 
using procedures known in the art. For example, an antisense nucleic acid molecule 
{e.g., an antisense oligonucleotide) can be chemically synthesized using naturally 
occurring nucleotides or variously modified nucleotides designed to increase the 
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biological stability of the molecules or to increase the physical stability of the duplex 
formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. Alternatively, the 
antisense nucleic acid molecule can be produced biologically using an expression 
5 vector into which a nucleic acid molecule has been subcloned in an antisense 

orientation (i.e., RNA transcribed from the inserted nucleic acid molecule will be of 
an antisense orientation to a target nucleic acid of interest). 

The nucleic acid sequences can also be used to compare with endogenous 
DNA sequences in patients to identify one or more of the disorders described above, 

10 and as probes, such as to hybridize and discover related DNA sequences or to subtract 
out known sequences from a sample. The nucleic acid sequences can further be used 
to derive primers for genetic fingerprinting, to raise anti-polypeptide antibodies using 
DNA immunization techniques, and as an antigen to raise anti-DNA antibodies or 
elicit immune responses. Portions or fragments of the nucleotide sequences identified 

15 herein (and the corresponding complete gene sequences) can be used in numerous 
ways as polynucleotide reagents. For example, these sequences can be used to: (i) 
map their respective genes on a chromosome; and, thus, locate gene regions 
associated with genetic disease; (ii) identify an individual from a minute biological 
sample (tissue typing); and (iii) aid in forensic identification of a biological sample. 

20 Additionally, the nucleotide sequences of the invention can be used to identify and 
express recombinant polypeptides for analysis, characterization or therapeutic use, or 
as markers for tissues in which the corresponding polypeptide is expressed, either 
constitutively, during tissue differentiation, or in diseased states. The nucleic acid 
sequences can additionally be used as reagents in the screening and/or diagnostic 

25 assays described herein, and can also be included as components of kits (e.g., reagent 
kits) for use in the screening and/or diagnostic assays described herein. 

Vectors and Host Cells 

Another aspect of the invention pertains to nucleic acid constructs containing a 
30 nucleic acid molecule selected from the group consisting of SEQ ID NOs: 1, 1 14-258 
and the complements thereof (or a portion thereof). The constructs comprise a vector 
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(e.g, an expression vector) into which a sequence of the invention has been inserted 
in a sense or antisense orientation. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another nucleic acid to which it has 
been linked. One type of vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can be ligated. Another 
type of vector is a viral vector, wherein additional DNA segments can be ligated into 
the viral genome. Certain vectors are capable of autonomous replication in a host cell 
into which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction 
into the host cell, and thereby are replicated along with the host genome. Expression 
vectors are capable of directing the expression of genes to which they are operably 
linked. In general, expression vectors of utility in recombinant DNA techniques are 
often in the form of plasmids. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
retroviruses, adenoviruses and adeno-associated viruses) that serve equivalent 
functions. 

In certain embodiments, recombinant expression vectors of the invention 
comprise a nucleic acid molecule of the invention in a form suitable for expression of 
the nucleic acid molecule in a host cell. This means that the recombinant expression 
vectors include one or more regulatory sequences, selected on the basis of the host 
cells to be used for expression, which is operably linked to the nucleic acid sequence 
to be expressed. Within a recombinant expression vector, "operably linked" or 
"operatively linked" is intended to mean that the nucleotide sequence of interest is 
linked to the regulatory sequence(s) in a manner which allows for expression of the 
nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host 
cell when the vector is introduced into the host cell). The term "regulatory sequence" 
is intended to include promoters, enhancers and other expression control elements 
(e.g., polyadenylation signals). Such regulatory sequences are described, for example, 
in Goeddel, "Gene Expression Technology", Methods in Enzymology 185, Academic 
Press, San Diego, CA (1990). Regulatory sequences include those which direct 
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constitutive expression of a nucleotide sequence in many types of host cell and those 
which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of 

5 the host cell to be transformed and the level of expression of polypeptide desired. The 
expression vectors of the invention can be introduced into host cells to thereby 
produce polypeptides, including fusion polypeptides, encoded by nucleic acid 
molecules as described herein. 

The recombinant expression vectors of the invention can be designed for 

10 expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells such as E. coli, insect cells (using baculovirus expression vectors), 
yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
supra. Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 

15 polymerase. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but also to the progeny or potential 

20 progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are still included within the scope of the 
term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic 

25 acid molecule of the invention can be expressed in bacterial cells (e.g., is. coli), insect 
cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS 
cells). Other suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 

30 "transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing a foreign nucleic acid molecule (eg., DNA) into a host 
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cell, including calcium phosphate or calcium chloride co-precipitation, DEAE- 
dextran-mediated transfection, lipofection, or electroporation. Suitable methods for 
transforming or transfecting host cells can be found in Sambrook, et al., (supra), and 
other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of cells 
may integrate the foreign DNA into their genome. In order to identify and select 
these integrants, a gene that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells along with the gene of interest. 
Preferred selectable markers include those that confer resistance to drugs, such as 
G418, hygromycin and methotrexate. Nucleic acid molecules encoding a selectable 
marker can be introduced into a host cell on the same vector as the nucleic acid 
molecule of the invention or can be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid molecule can be identified by drug 
selection (e.g., cells that have incorporated the selectable marker gene will survive, 
while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) a polypeptide of the invention. 
Accordingly, the invention further provides methods for producing a polypeptide 
using the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a recombinant expression vector 
encoding a polypeptide of the invention has been introduced) in a suitable medium 
such that the polypeptide is produced. In another embodiment, the method further 
comprises isolating the polypeptide from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 
transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which a nucleic acid molecule of the 
invention has been introduced (e.g., an exogenous KChlPl gene, or an exogenous 
nucleic acid encoding a KChlPl polypeptide). Such host cells can then be used to 
create non-human transgenic animals in which exogenous nucleotide sequences have 
been introduced into the genome or homologous recombinant animals in which 
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endogenous nucleotide sequences have been altered. Such animals are useful for 
studying the function and/or activity of the nucleotide sequence and polypeptide 
encoded by the sequence and for identifying and/or evaluating modulators of their 
activity. As used herein, a "transgenic animal" is a non-human animal, preferably a 
mammal, more preferably a rodent such as a rat or mouse, in which one or more of 
the cells of the animal include a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens and amphibians. A 
transgene is exogenous DNA which is integrated into the genome of a cell from which 
a transgenic animal develops and which remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene product in one or more cell types 
or tissues of the transgenic animal As used herein, an "homologous recombinant 
animal" is a non-human animal, preferably a mammal, more preferably a mouse, in 
which an endogenous gene has been altered by homologous recombination between 
the endogenous gene and an exogenous DNA molecule introduced into a cell of the 
animal, e.g., an embryonic cell of the animal, prior to development of the animal. 

Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Pat. NO: 4,873,191 and in Hogan, Manipulating the Mouse Embryo (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Methods for constructing 
homologous recombination vectors and homologous recombinant animals are 
described further in Bradley, Current Opinion in BioTechnology 2:823-829 (1991) 
and in PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 92/0968, and WO 
93/04169. Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut et aL, Nature 385:810-813 
(1997) and PCT Publication Nos. WO 97/07668 and WO 97/07669. 

POLYPEPTIDES OF THE INVENTION 

The present invention also pertains to isolated polypeptides encoded by 
KChlPl nucleic acids ("KChlPl polypeptides," or "KChlPl proteins," such as the 
protein shown in SEQ ID NO: 2) and fragments and variants thereof, as well as 
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polypeptides encoded by nucleotide sequences described herein (e.g., other splicing 
variants). The term "polypeptide" refers to a polymer of amino acids, and not to a 
specific length; thus, peptides, oligopeptides and proteins are included within the 
definition of a polypeptide. As used herein, a polypeptide is said to be "isolated" or 
"purified" when it is substantially free of cellular material when it is isolated from 
recombinant and non-recombinant cells, or free of chemical precursors or other 
chemicals when it is chemically synthesized. A polypeptide, however, can be joined 
to another polypeptide with which it is not normally associated in a cell (e.g., in a 
"fusion protein") and still be "isolated" or "purified." 

The polypeptides of the invention can be purified to homogeneity. It is 
understood, however, that preparations in which the polypeptide is not purified to 
homogeneity are useful. The critical feature is that the preparation allows for the 
desired function of the polypeptide, even in the presence of considerable amounts of 
other components. Thus, the invention encompasses various degrees of purity. In one 
embodiment, the language "substantially free of cellular material" includes 
preparations of the polypeptide having less than about 30% (by dry weight) other 
proteins (i.e., contaminating protein), less than about 20% other proteins, less than 
about 1 0% other proteins, or less than about 5% other proteins. 

When a polypeptide is recombinantly produced, it can also be substantially 
free of culture medium, i.e., culture medium represents less than about 20%, less than 
about 10%, or less than about 5% of the volume of the polypeptide preparation. The 
language "substantially free of chemical precursors or other chemicals" includes 
preparations of the polypeptide in which it is separated from chemical precursors or 
other chemicals that are involved in its synthesis. In one embodiment, the language 
"substantially free of chemical precursors or other chemicals" includes preparations of 
the polypeptide having less than about 30% (by dry weight) chemical precursors or 
other chemicals, less than about 20% chemical precursors or other chemicals, less 
than about 10% chemical precursors or other chemicals, or less than about 5% 
chemical precursors or other chemicals. 

In one embodiment, a polypeptide of the invention comprises an amino acid 
sequence encoded by a nucleic acid molecule comprising a nucleotide sequence of 
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SEQ ID NO: 1 , optionally additionally comprising one or more of SEQ ID NOs: 1 14- 
258; or the complement of such a nucleic acid, or portions thereof, or a portion or 
polymorphic variant thereof However, the polypeptides of the invention also 
encompass fragment and sequence variants. Variants include a substantially 
homologous polypeptide encoded by the same genetic locus in an organism, i.e., an 
allelic variant, as well as other splicing variants. Variants also encompass 
polypeptides derived from other genetic loci in an organism, but having substantial 
homology to a polypeptide encoded by a nucleic acid molecule comprising a 
nucleotide of SEQ ID NO: 1, optionally additionally one or more of SEQ ID NOs: 
1 14-258; or a complement of such a sequence, or portions thereof or polymorphic 
variants thereof Variants also include polypeptides substantially homologous or 
identical to these polypeptides but derived from another organism, i.e., an ortholog. 
Variants also include polypeptides that are substantially homologous or identical to 
these polypeptides that are produced by chemical synthesis. Variants also include 
polypeptides that are substantially homologous or identical to these polypeptides that 
are produced by recombinant methods. 

As used herein, two polypeptides (or a region of the polypeptides) are 
substantially homologous or identical when the amino acid sequences are at least 
about 45-55%, in certain embodiments at least about 70-75%, and in other 
embodiments at least about 80-85%, and in other embodiments greater than about 
90% or more homologous or identical. A substantially homologous amino acid 
sequence, according to the present invention, will be encoded by a nucleic acid 
molecule hybridizing to of SEQ ID NO: 1 or any one of 1 14-258 or portion thereof, 
under stringent conditions as more particularly described above, or will be encoded by 
a nucleic acid molecule hybridizing to a nucleic acid sequence encoding SEQ ID NO: 
1 or any one of 1 14-258 or a portion thereof or polymorphic variant thereof, under 
stringent conditions as more particularly described thereof 

The invention also encompasses polypeptides having a lower degree of 
identity but having sufficient similarity so as to perform one or more of the same 
functions performed by a polypeptide encoded by a nucleic acid molecule of the 
invention. 
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Similarity is determined by conserved amino acid substitution where a given 
amino acid in a polypeptide is substituted by another amino acid of like 
characteristics. Conservative substitutions are likely to be phenotypically silent. 
Typically seen as conservative substitutions are the replacements, one for another, 
5 among the aliphatic amino acids Ala, Val, Leu and He; interchange of the hydroxyl 
residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution 
between the amide residues Asn and Gin, exchange of the basic residues Lys and Arg 
and replacements among the aromatic residues Phe and Tyr. Guidance concerning 
which amino acid changes are likely to be phenotypically silent are found in Bowie et 

10 al, Science 247:1306-1310 (1990). 

A variant polypeptide can differ in amino acid sequence by one or more 
substitutions, deletions, insertions, inversions, fusions, and truncations or a 
combination of any of these. Further, variant polypeptides can be fully functional or 
can lack function in one or more activities. Fully functional variants typically contain 

15 only conservative variation or variation in non-critical residues or in non-critical 

regions. Functional variants can also contain substitution of similar amino acids that 
result in no change or an insignificant change in function. Alternatively, such 
substitutions may positively or negatively affect function to some degree. Non- 
functional variants typically contain one or more non-conservative amino acid 

20 substitutions, deletions, insertions, inversions, or truncation or a substitution, 
insertion, inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods 
known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis 
(Cunningham et ai, Science 244:1082-1 185 (1989)). The latter procedure introduces 

25 single alanine mutations at every residue in the molecule. The resulting mutant 
molecules are then tested for biological activity in vitro, or in vitro proliferative 
activity. Sites that are critical for polypeptide activity can also be determined by 
structural analysis such as crystallization, nuclear magnetic resonance or photoaffinity 
labeling (Smith et al. 9 J. Mol Biol 224:899-904 (1992); de Vos et al. 9 Science 

30 255:306-312(1992)). 
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The invention also includes polypeptide fragments of the polypeptides of the 
invention. Fragments can be derived from a polypeptide encoded by a nucleic acid 
molecule comprising SEQ ID NO: 1 and optionally comprising one or more of SEQ 
IDNOs: 114-258: or a complement of such a nucleic acid or other variants. 
5 However, the invention also encompasses fragments of the variants of the 
polypeptides described herein. As used herein, a fragment comprises at least 6 
contiguous amino acids. Useful fragments include those that retain one or more of the 
biological activities of the polypeptide as well as fragments that can be used as an 
immunogen to generate polypeptide-specific antibodies. 

10 Biologically active fragments (peptides which are, for example, 6, 9, 12, 1 5, 

16, 20, 30, 35, 36, 37, 38, 39, 40, 50, 100 or more amino acids in length) can comprise 
a domain, segment, or motif that has been identified by analysis of the polypeptide 
sequence using well-known methods, e.g., signal peptides, extracellular domains, one 
or more transmembrane segments or loops, ligand binding regions, zinc finger 

15 domains, DNA binding domains, acylation sites, glycosylation sites, or 
phosphorylation sites. 

Fragments can be discrete (not fused to other amino acids or polypeptides) or 
can be within a larger polypeptide. Further, several fragments can be comprised 
within a single larger polypeptide. In one embodiment a fragment designed for 

20 expression in a host can have heterologous pre- and pro-polypeptide regions fused to 
the amino terminus of the polypeptide fragment and an additional region fused to the 
carboxyl terminus of the fragment. 

The invention thus provides chimeric or fusion polypeptides. These comprise 
a polypeptide of the invention operatively linked to a heterologous protein or 

25 polypeptide having an amino acid sequence not substantially homologous to the 
polypeptide. 

"Operatively linked" indicates that the polypeptide and the heterologous 
protein are fused in-frame. The heterologous protein can be fused to the N-terminus 
or C-terminus of the polypeptide. In one embodiment the fusion polypeptide does not 
30 affect function of the polypeptide per se. For example, the fusion polypeptide can be 
a GST-fusion polypeptide in which the polypeptide sequences are fused to the C- 
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terminus of the GST sequences. Other types of fusion polypeptides include, but are 
not limited to, enzymatic fusion polypeptides, for example beta-galactosidase fusions, 
yeast two-hybrid GAL fusions, poly-His fusions and Ig fusions. Such fusion 
polypeptides, particularly poly-His fusions, can facilitate the purification of 
recombinant polypeptide. In certain host cells (e.g., mammalian host cells), 
expression and/or secretion of a polypeptide can be increased using a heterologous 
signal sequence. Therefore, in another embodiment, the fusion polypeptide contains a 
heterologous signal sequence at its N-terminus. 

EP-A-0 464 533 discloses fusion proteins comprising various portions of 
immunoglobulin constant regions. The Fc is useful in therapy and diagnosis and thus 
results, for example, in improved pharmacokinetic properties (EP-A 0232 262). In 
drug discovery, for example, human proteins have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists. Bennett et aL 9 
Journal of Molecular Recognition, 5:52-58 (1995) and Johanson et al, The Journal of 
Biological Chemistry, 270,16:9459-94'/ '1 (1995). Thus, this invention also 
encompasses soluble fusion polypeptides containing a polypeptide of the invention 
and various portions of the constant regions of heavy or light chains of 
immunoglobulins of various subclasses (IgG, IgM, IgA, IgE). 

A chimeric or fusion polypeptide can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional techniques. 
In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification 
of nucleic acid fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive nucleic acid fragments which can 
subsequently be annealed and re-amplified to generate a chimeric nucleic acid 
sequence (see Ausubel et al , Current Protocols in Molecular Biology, 1 992). 

Moreover, many expression vectors are commercially available that already 
encode a fusion moiety {e.g., a GST protein). A nucleic acid molecule encoding a 
polypeptide of the invention can be cloned into such an expression vector such that 
the fusion moiety is linked in-frame to the polypeptide. 
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The isolated polypeptide can be purified from cells that naturally express it, 
can be purified from cells that have been altered to express it (recombinant), or 
synthesized using known protein synthesis methods. In one embodiment, the 
polypeptide is produced by recombinant DNA techniques. For example, a nucleic 

5 acid molecule encoding the polypeptide is cloned into an expression vector, the 

expression vector introduced into a host cell and the polypeptide expressed in the host 
cell. The polypeptide can then be isolated from the cells by an appropriate 
purification scheme using standard protein purification techniques. 

The polypeptides of the present invention can be used to raise antibodies or to 

10 elicit an immune response. The polypeptides can also be used as a reagent, e.g., a 
labeled reagent, in assays to quantitatively determine levels of the polypeptide or a 
molecule to which it binds (e.g., a ligand) in biological fluids. The polypeptides can 
also be used as markers for cells or tissues in which the corresponding polypeptide is 
preferentially expressed, either constitutively, during tissue differentiation, or in a 

15 diseased state. The polypeptides can be used to isolate a corresponding binding agent, 
e.g., ligand or receptor, such as, for example, in an interaction trap assay, and to 
screen for peptide or small molecule antagonists or agonists of the binding interaction. 

ANTIBODIES OF THE INVENTION 

20 Polyclonal antibodies and/or monoclonal antibodies that specifically bind one 

form of the gene product but not to the other form of the gene product are also 
provided. Antibodies are also provided which bind a portion of either the variant or 
the reference gene product that contains the polymorphic site or sites. The term 
"antibody" as used herein refers to immunoglobulin molecules and immunologically 

25 active portions of immunoglobulin molecules, i.e. , molecules that contain an antigen 
binding site that specifically bind an antigen. A molecule that specifically binds to a 
polypeptide of the invention is a molecule that binds to that polypeptide or a fragment 
thereof, but does not substantially bind other molecules in a sample, e.g., a biological 
sample, which naturally contains the polypeptide. Examples of immunologically 

30 active portions of immunoglobulin molecules include F(ab) and F(ab')2 fragments 
which can be generated by treating the antibody with an enzyme such as pepsin. The 
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invention provides polyclonal and monoclonal antibodies that bind to a polypeptide of 
the invention. The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of antibody molecules that 
contain only one species of an antigen binding site capable of immunoreacting with a 
5 particular epitope of a polypeptide of the invention. A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular 
polypeptide of the invention with which it immunoreacts. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a desired immunogen, e.g., polypeptide of the invention or a 

10 fragment thereof. The antibody titer in the immunized subject can be monitored over 
time by standard techniques, such as with an enzyme linked immunosorbent assay 
(ELISA) using immobilized polypeptide. If desired, the antibody molecules directed 
against the polypeptide can be isolated from the mammal (e.g., from the blood) and 
further purified by well-known techniques, such as protein A chromatography to 

15 obtain the IgG fraction. At an appropriate time after immunization, e.g., when the 
antibody titers are highest, antibody-producing cells can be obtained from the subject 
and used to prepare monoclonal antibodies by standard techniques, such as the 
hybridoma technique originally described by Kohler and Milstein, Nature 256:495- 
497 (1975), the human B cell hybridoma technique (Kozbor et al, Immunol Today 4: 

20 72 (1 983)), the EBV-hybridoma technique (Cole et al. , Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss,1985, Inc., pp. 77-96) or trioma techniques. The 
technology for producing hybridomas is well known (see generally Current Protocols 
in Immunology (1994) Coligan et al, (eds.) John Wiley & Sons, Inc., New York, 
NY). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes 

25 (typically splenocytes) from a mammal immunized with an immunogen as described 
above, and the culture supernatants of the resulting hybridoma cells are screened to 
identify a hybridoma producing a monoclonal antibody that binds a polypeptide of the 
invention. 

Any of the many well known protocols used for fusing lymphocytes and 
30 immortalized cell lines can be applied for the purpose of generating a monoclonal 
antibody to a polypeptide of the invention (see, e.g., Current Protocols in 
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Immunology, supra; Galfre et al, Nature 266:55052 (1977); R.H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); and Lerner, Yale J. Biol Med. 54:387-402 
(1981)). Moreover, the ordinarily skilled worker will appreciate that there are many 
5 variations of such methods that also would be useful. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal antibody to a polypeptide of the invention can be identified and isolated 
by screening a recombinant combinatorial immunoglobulin library {e.g., an antibody 
phage display library) with the polypeptide to thereby isolate immunoglobulin library 

10 members that bind the polypeptide. Kits for generating and screening phage display 
libraries are commercially available (e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog NO: 27-9400-01 ; and the Stratagene SurfZAP™ Phage 
Display Kit, Catalog NO: 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library can 

15 be found in, for example, U.S. Patent NO: 5,223,409; PCT Publication NO: WO 

92/18619; PCT Publication NO: WO 91/17271; PCT Publication NO: WO 92/20791; 
PCT Publication NO: WO 92/15679; PCT Publication NO: WO 93/01288; PCT 
Publication NO: WO 92/01047; PCT Publication NO: WO 92/09690; PCT 
Publication NO: WO 90/02809; Fuchs et al, Bio/Technology 9: 1370-1372 (1991); 

20 Hay et al, Hum. Antibod. Hybridomas 3:81-85 (1992); Huse et al, Science 246: 
1275-1281 (1989); and Griffiths et al, EMBO J. 12:725-734(1993). 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can 
be made using standard recombinant DNA techniques, are within the scope of the 

25 invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art. 

In general, antibodies of the invention (e.g., a monoclonal antibody) can be 
used to isolate a polypeptide of the invention by standard techniques, such as affinity 
chromatography or immunoprecipitation. A polypeptide-specific antibody can 

30 facilitate the purification of natural polypeptide from cells and of recombinantly 
produced polypeptide expressed in host cells. Moreover, an antibody specific for a 
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polypeptide of the invention can be used to detect the polypeptide (e.g., in a cellular 
lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and 
pattern of expression of the polypeptide. Antibodies can be used diagnostically to 
monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 

5 example, determine the efficacy of a given treatment regimen. The antibody can be 
coupled to a detectable substance to facilitate its detection. Examples of detectable 
substances include various enzymes, prosthetic groups, fluorescent materials, 
luminescent materials, bioluminescent materials, and radioactive materials. Examples 
of suitable enzymes include horseradish peroxidase, alkaline phosphatase, beta- 

10 galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 

15 materials include luciferase, luciferin, and aequorin, and examples of suitable 
radioactive material include 125 1, 13 1 1, 35 S or 3 H. 

DIAGNOSTIC ASSAYS 

The nucleic acids, probes, primers, polypeptides and antibodies described 

20 herein can be used in methods of diagnosis of Type II diabetes; of a susceptibility to 
Type II diabetes; or of a condition associated with a KChlPl gene, as well as in kits 
(e.g., useful for diagnosis of Type II diabetes; a susceptibility to Type II diabetes; or a 
condition associated with a KChlPl gene). In one embodiment, the kit comprises 
primers which contain one or more of the SNP's identified in Table 10. 

25 In one embodiment of the invention, diagnosis of a disease or condition 

associated with a KChlPl gene (e.g., diagnosis of Type II diabetes, or of a 
susceptibility to Type II diabetes) is made by detecting a polymorphism in a KChlPl 
nucleic acid as described herein. The polymorphism can be a change in a KChlPl 
nucleic acid, such as the insertion or deletion of a single nucleotide, or of more than 

30 one nucleotide, resulting in a frame shift; the change of at least one nucleotide, 

resulting in a change in the encoded amino acid; the change of at least one nucleotide, 
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resulting in the generation of a premature stop codon; the deletion of several 
nucleotides, resulting in a deletion of one or more amino acids encoded by the 
nucleotides; the insertion of one or several nucleotides, such as by unequal 
recombination or gene conversion, resulting in an interruption of the coding sequence 
of the gene; duplication of all or a part of the gene; transposition of all or a part of the 
gene; or rearrangement of all or a part of the gene. More than one such change may 
be present in a single gene. Such sequence changes cause a difference in the 
polypeptide encoded by a KChlPl nucleic acid. For example, if the difference is a 
frame shift change, the frame shift can result in a change in the encoded amino acids, 
and/or can result in the generation of a premature stop codon, causing generation of a 
truncated polypeptide. Alternatively, a polymorphism associated with a disease or 
condition or a susceptibility to a disease or condition associated with a KChlPl 
nucleic acid can be a synonymous alteration in one or more nucleotides (i.e., an 
alteration that does not result in a change in the polypeptide encoded by a KChlPl 
nucleic acid). Such a polymorphism may alter splicing sites, affect the stability or 
transport of mRNA, or otherwise affect the transcription or translation of the gene. A 
KChlPl nucleic acid that has any of the changes or alterations described above is 
referred to herein as an "altered nucleic acid." 

In a first method of diagnosing Type II diabetes or a susceptibility to Type II 
diabetes, or another disease or condition associated with a KChlPl gene, 
hybridization methods, such as Southern analysis, Northern analysis, or in situ 
hybridizations, can be used (see Current Protocols in Molecular Biology, Ausubel, F. 
et al, eds, John Wiley & Sons, including all supplements through 1999). For 
example, a biological sample (a "test sample") from a test subject (the "test 
individual") of genomic DNA, RNA, or cDNA, is obtained from an individual, such 
as an individual suspected of having, being susceptible to or predisposed for, or 
carrying a defect for, the disease or condition, or the susceptibility to the disease or 
condition, associated with a KChlPl gene {e.g., Type II diabetes). The individual can 
be an adult, child, or fetus. The test sample can be from any source which contains 
genomic DNA, such as a blood sample, sample of amniotic fluid, sample of 
cerebrospinal fluid, or tissue sample from skin, muscle, buccal or conjunctival 
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mucosa, placenta, gastrointestinal tract or other organs. A test sample of DNA from 
fetal cells or tissue can be obtained by appropriate methods, such as by amniocentesis 
or chorionic villus sampling. The DNA, RNA, or cDNA sample is then examined to 
determine whether a polymorphism in a KChlPl nucleic acid is present, and/or to 
5 determine which splicing variant(s) encoded by the KChlPl is present. The presence 
of the polymorphism or splicing variant(s) can be indicated by hybridization of the 
gene in the genomic DNA, RNA, or cDNA to a nucleic acid probe. A "nucleic acid 
probe", as used herein, can be a DNA probe or an RNA probe; the nucleic acid probe 
can contain, for example, at least one polymorphism in a KChlPl nucleic acid {e.g., 

10 as set forth in Table 10) and/or contain a nucleic acid encoding a particular splicing 
variant of a KChlPl nucleic acid. The probe can be any of the nucleic acid molecules 
described above {e.g., the gene or nucleic acid, a fragment, a vector comprising the 
gene or nucleic acid, a probe or primer, etc.). 

To diagnose Type II diabetes, or a susceptibility to Type II diabetes, or 

15 another condition associated with a KChlPl gene, a hybridization sample is formed 
by contacting the test sample containing a KChlPl nucleic acid with at least one 
nucleic acid probe. A preferred probe for detecting mRNA or genomic DNA is a 
labeled nucleic acid probe capable of hybridizing to mRNA or genomic DNA 
sequences described herein. The nucleic acid probe can be, for example, a full-length 

20 nucleic acid molecule, or a portion thereof, such as an oligonucleotide of at least 15, 
30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize 
under stringent conditions to appropriate mRNA or genomic DNA. For example, the 
nucleic acid probe can be all or a portion of one of SEQ ID NOs: 1 14-258 or the 
complement thereof, or a portion thereof. Other suitable probes for use in the 

25 diagnostic assays of the invention are described above (see e.g., probes and primers 
discussed under the heading, "Nucleic Acids of the Invention"). 

The hybridization sample is maintained under conditions that are sufficient to 
allow specific hybridization of the nucleic acid probe to a KChlPl nucleic acid. 
"Specific hybridization", as used herein, indicates exact hybridization {e.g., with no 

30 mismatches). Specific hybridization can be performed under high stringency 

conditions or moderate stringency conditions, for example, as described above. In a 
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particularly preferred embodiment, the hybridization conditions for specific 
hybridization are high stringency. 

Specific hybridization, if present, is then detected using standard methods. If 
specific hybridization occurs between the nucleic acid probe and KChlPl nucleic acid 
5 in the test sample, then the KChlPl has the polymorphism, or is the splicing variant, 
that is present in the nucleic acid probe. More than one nucleic acid probe can also be 
used concurrently in this method. Specific hybridization of any one of the nucleic 
acid probes is indicative of a polymorphism in the KChlPl nucleic acid, or of the 
presence of a particular splicing variant encoding the KChlPl nucleic acid and is 

10 therefore diagnostic for a susceptibility to a disease or condition associated with a 
KChlPl nucleic acid (e.g., Type II diabetes). 

In Northern analysis (see Current Protocols in Molecular Biology, Ausubel, F. 
et aL 9 eds., John Wiley & Sons, supra) the hybridization methods described above are 
used to identify the presence of a polymorphism or a particular splicing variant, 

15 associated with a susceptibility to a disease or condition associated with a KChlPl 
gene (e.g., Type II diabetes). For Northern analysis, a test sample of RNA is obtained 
from the individual by appropriate means. Specific hybridization of a nucleic acid 
probe, as described above, to RNA from the individual is indicative of a 
polymorphism in a KChlPl nucleic acid, or of the presence of a particular splicing 

20 variant encoded by a KChlPl nucleic acid and is therefore diagnostic for Type II 
diabetes or a susceptibility to Type II diabetes or a condition associated with a 
KChlPl nucleic acid (e.g., Type II diabetes). 

For representative examples of use of nucleic acid probes, see, for example, 
U.S. Patents NO: 5,288,61 1 and 4,851,330. 

25 Alternatively, a peptide nucleic acid (PNA) probe can be used instead of a 

nucleic acid probe in the hybridization methods described above. PNA is a DNA 
mimic having a peptide-like, inorganic backbone, such as N-(2-aminoethyl)glycine 
units, with an organic base (A, G, C, T or U) attached to the glycine nitrogen via a 
methylene carbonyl linker (see, for example, Nielsen, P.E. et al., Bioconjugate 

30 Chemistry 5, American Chemical Society, p. 1 (1994). The PNA probe can be 

designed to specifically hybridize to a gene having a polymorphism associated with a 
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susceptibility to a disease or condition associated with a KChlPl nucleic acid (e.g., 
Type II diabetes). Hybridization of the PNA probe to a KChlPl gene is diagnostic for 
Type II diabetes or a susceptibility to Type II diabetes or a condition associated with a 
KChlPl nucleic acid. 

5 In another method of the invention, alteration analysis by restriction digestion 

can be used to detect an altered gene, or genes containing a polymorphism(s), if the 
alteration (mutation) or polymorphism in the gene results in the creation or 
elimination of a restriction site. A test sample containing genomic DNA is obtained 
from the individual. Polymerase chain reaction (PCR) can be used to amplify a 

10 KChlPl nucleic acid (and, if necessary, the flanking sequences) in the test sample of 
genomic DNA from the test individual. RFLP analysis is conducted as described (see 
Current Protocols in Molecular Biology, supra). The digestion pattern of the relevant 
DNA fragment indicates the presence or absence of the alteration or polymorphism in 
the KChlPl nucleic acid, and therefore indicates the presence or absence of Type II 

15 diabetes or the susceptibility to a disease or condition associated with a KChlPl 
nucleic acid. 

Sequence analysis can also be used to detect specific polymorphisms in a 
KChlPl nucleic acid. A test sample of DNA or RNA is obtained from the test 
individual. PCR or other appropriate methods can be used to amplify the gene or 

20 nucleic acid, and/or its flanking sequences, if desired. The sequence of a KChlPl 

nucleic acid, or a fragment of the nucleic acid, or cDNA, or fragment of the cDNA, or 
mRNA, or fragment of the mRNA, is determined, using standard methods. The 
sequence of the nucleic acid, nucleic acid fragment, cDNA, cDNA fragment, mRNA, 
or mRNA fragment is compared with the known nucleic acid sequence of the gene or 

25 cDNA (e.g. , one or more of SEQ ID NOs:, 1 14-258 or a complement thereof ) or 

mRNA, as appropriate. The presence of a polymorphism in the KChlPl indicates that 
the individual has Type II diabetes or a susceptibility to Type II diabetes. 

Allele-specific oligonucleotides, can also be used to detect the presence of a 
polymorphism in a KChlPl nucleic acid, through the use of dot-blot hybridization of 

30 amplified oligonucleotides with allele-specific oligonucleotide (ASO) probes (see, for 
example, Saiki, R. et al., Nature 324:163-166 (1986)). An "allele-specific 
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oligonucleotide" (also referred to herein as an "allele-specific oligonucleotide probe") 
is an oligonucleotide of approximately 10-50 base pairs, preferably approximately 15- 
30 base pairs, that specifically hybridizes to a KChlPl nucleic acid, and that contains 
a polymorphism associated with a susceptibility to a disease or condition associated 

5 with a KChlPl nucleic acid. An allele-specific oligonucleotide probe that is specific 
for particular polymorphisms in a KChlPl nucleic acid can be prepared, using 
standard methods (see Current Protocols in Molecular Biology, supra). To identify 
polymorphisms in the gene that are associated with a disease or condition associated 
with a KChlPl nucleic acid or a susceptibility to a disease or condition associated 

10 with a KChlPl nucleic acid a test sample of DNA is obtained from the individual. 
. PCR can be used to amplify all or a fragment of a KChlPl nucleic acid and its 
flanking sequences. The DNA containing the amplified KChlPl nucleic acid (or 
fragment of the gene or nucleic acid) is dot-blotted, using standard methods (see 
Current Protocols in Molecular Biology, supra), and the blot is contacted with the 

15 oligonucleotide probe. The presence of specific hybridization of the probe to the 
amplified KChlPl nucleic acid is then detected. Hybridization of an allele-specific 
oligonucleotide probe to DNA from the individual is indicative of a polymorphism in 
the KChlPl nucleic acid, and is therefore indicative of a disease or condition 
associated with a KChlPl nucleic acid or susceptibility to a disease or condition 

20 associated with a KChlPl nucleic acid {e.g., Type II diabetes). 

The invention further provides allele-specific oligonucleotides that hybridize 
to the reference or variant allele of a gene or nucleic acid comprising a single 
nucleotide polymorphism or to the complement thereof. These oligonucleotides can 
be probes or primers. 

25 An allele-specific primer hybridizes to a site on target DNA overlapping a 

polymorphism and only primes amplification of an allelic form to which the primer 
exhibits perfect complementarity. See Gibbs, Nucleic Acid Res. 17, 2427-2448 
(1989). This primer is used in conjunction with a second primer, which hybridizes at 
a distal site. Amplification proceeds from the two primers, resulting in a detectable 

30 product, which indicates the particular allelic form is present. A control is usually 
performed with a second pair of primers, one of which shows a single base mismatch 
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at the polymorphic site and the other of which exhibits perfect complementarity to a 
distal site. The single-base mismatch prevents amplification and no detectable 
product is formed. The method works best when the mismatch is included in the 3'- 
most position of the oligonucleotide aligned with the polymorphism because this 
5 position is most destabilizing to elongation from the primer (see, e.g., WO 93/22456). 

With the addition of such analogs as locked nucleic acids (LNAs), the size of 
primers and probes can be reduced to as few as 8 bases. LNAs are a novel class of 
bicyclic DNA analogs in which the T and 4' positions in the furanose ring are joined 
via an O-methylene (oxy-LNA), S-methylene (thio-LNA), or amino methylene 

10 (amino-LNA) moiety. Common to all of these LNA variants is an affinity toward 
complementary nucleic acids, which is by far the highest reported for a DNA analog. 
For example, particular all oxy-LNA nonamers have been shown to have melting 
temperatures of 64 °C and 74°C when in complex with complementary DNA or RNA, 
respectively, as oposed to 28 °C for both DNA and RNA for the corresponding DNA 

15 nonamer. Substantial increases in T m are also obtained when LNA monomers are 
used in combination with standard DNA or RNA monomers. For primers and probes, 
depending on where the LNA monomers are included {e.g., the 3' end, the 5'end, or in 
the middle), the T m could be increased considerably. 

In another embodiment, arrays of oligonucleotide probes that are 

20 complementary to target nucleic acid sequence segments from an individual, can be 
used to identify polymorphisms in a KChlPl nucleic acid. For example, in one 
embodiment, an oligonucleotide array can be used. Oligonucleotide arrays typically 
comprise a plurality of different oligonucleotide probes that are coupled to a surface 
of a substrate in different known locations. These oligonucleotide arrays, also 

25 described as "Genechips™," have been generally described in the art, for example, 
U.S. Pat. NO: 5,143,854 and PCT patent publication Nos. WO 90/15070 and 
92/10092. These arrays can generally be produced using mechanical synthesis 
methods or light directed synthesis methods that incorporate a combination of 
photolithographic methods and solid phase oligonucleotide synthesis methods. See 

30 Fodor et al. 9 Science 251:767-777 (1991), Pirrung et aL, U.S. Pat. NO: 5,143,854 (see 
also PCT Application NO: WO 90/15070) and Fodor et al. 9 PCT Publication NO: WO 
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92/10092 and U.S. Pat. NO: 5,424,186, the entire teachings of each of which are 
incorporated by reference herein. Techniques for the synthesis of these arrays using 
mechanical synthesis methods are described in, e.g., U.S. Pat. NO: 5,384,261; the 
entire teachings of which are incorporated by reference herein. In another example, 
5 linear arrays can be utilized. 

Once an oligonucleotide array is prepared, a nucleic acid of interest is 
hybridized with the array and scanned for polymorphisms. Hybridization and 
scanning are generally carried out by methods described herein and also in, e.g., 
published PCT Application Nos. WO 92/10092 and WO 95/1 1995, and U.S. Pat. NO: 

10 5,424, 1 86, the entire teachings of which are incorporated by reference herein. In 
brief, a target nucleic acid sequence that includes one or more previously identified 
polymorphic markers is amplified by well-known amplification techniques, e.g., PCR. 
Typically, this involves the use of primer sequences that are complementary to the 
two strands of the target sequence both upstream and downstream from the 

15 polymorphism. Asymmetric PCR techniques may also be used. Amplified target, 
generally incorporating a label, is then hybridized with the array under appropriate 
conditions. Upon completion of hybridization and washing of the array, the array is 
scanned to determine the position on the array to which the target sequence 
hybridizes. The hybridization data obtained from the scan is typically in the form of 

20 fluorescence intensities as a function of location on the array. 

Although primarily described in terms of a single detection block, e.g., for 
detection of a single polymorphism, arrays can include multiple detection blocks, and 
thus be capable of analyzing multiple, specific polymorphisms. In alternative 
arrangements, it will generally be understood that detection blocks may be grouped 

25 within a single array or in multiple, separate arrays so that varying, optimal conditions 
may be used during the hybridization of the target to the array. For example, it may 
often be desirable to provide for the detection of those polymorphisms that fall within 
G-C rich stretches of a genomic sequence, separately from those falling in A-T rich 
segments. This allows for the separate optimization of hybridization conditions for 

30 each situation. 
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Additional uses of oligonucleotide arrays for polymorphism detection can be 
found, for example, in U.S. Patents Nos. 5,858,659 and 5,837,832, the entire 
teachings of which are incorporated by reference herein. Other methods of nucleic 
acid analysis can be used to detect polymorphisms in a Type II diabetes gene or 
5 variants encoding by a Type II diabetes gene. Representative methods include direct 
manual sequencing (Church and Gilbert, Proc. Natl. Acad. Sci. USA 81:1991-1995 
(1988); Sanger, F. et al, Proc. Natl. Acad. Sci. USA 74:5463-5467 (1977); Beavis et 
al., U.S. Pat. NO: 5,288,644); automated fluorescent sequencing; single-stranded 
conformation polymorphism assays (SSCP); clamped denaturing gel electrophoresis 

10 (CDGE); denaturing gradient gel electrophoresis (DGGE) (Sheffield, V.C. et al, 
Proc. Natl Acad. Sci. USA 86:232-236 (1989)), mobility shift analysis (Orita, M. et 
al, Proc. Natl Acad. Sci. USA 86:2766-2770 (1989)), restriction enzyme analysis 
(Flavell et al, Cell 15:25 (1978); Geever, et al, Proc. Natl. Acad Sci. USA 78:5081 
(1981)); heteroduplex analysis; chemical mismatch cleavage (CMC) (Cotton et al, 

15 Proc. Natl Acad. Sci. USA 85:4397-4401 (1985)); RNase protection assays (Myers, 
R.M. et al, Science 230:1242 (1985)); use of polypeptides which recognize 
nucleotide mismatches, such as E. coli mutS protein; allele-specific PCR, for 
example. 

In one embodiment of the invention, diagnosis of a disease or condition 
20 associated with a KChlPl nucleic acid {e.g., Type II diabetes) or a susceptibility to a 
disease or condition associated with a KChlPl nucleic acid {e.g., Type II diabetes) 
can also be made by expression analysis by quantitative PCR (kinetic thermal 
cycling). This technique, utilizing TaqMan®, can be used to allow the identification of 
polymorphisms and whether a patient is homozygous or heterozygous. The technique 
25 can assess the presence of an alteration in the expression or composition of the 
polypeptide encoded by a KChlPl nucleic acid or splicing variants encoded by a 
KChlPl nucleic acid. Further, the expression of the variants can be quantified as 
physically or functionally different. 

In another embodiment of the invention, diagnosis of Type II diabetes or a 
30 susceptibility to Type II diabetes 9or a condition associated with a KChlPl gene) can 
be made by examining expression and/or composition of a KChlPl polypeptide, by a 
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variety of methods, including enzyme linked immunosorbent assays (ELIS As), 
Western blots, immunoprecipitations and immunofluorescence. A test sample from 
an individual is assessed for the presence of an alteration in the expression and/or an 
alteration in composition of the polypeptide encoded by a KChlPl nucleic acid, or for 
the presence of a particular variant encoded by a KChlPl nucleic acid. An alteration 
in expression of a polypeptide encoded by a KChlPl nucleic acid can be, for example, 
an alteration in the quantitative polypeptide expression (i.e., the amount of 
polypeptide produced); an alteration in the composition of a polypeptide encoded by a 
KChlPl nucleic acid is an alteration in the qualitative polypeptide expression (e.g., 
expression of an altered KChlPl polypeptide or of a different splicing variant). In a 
preferred embodiment, diagnosis of the disease or condition associated with KChlPl 
nucleic acid or a susceptibility to a disease or condition associated with a KChlPl 
nucleic acid is made by detecting a particular splicing variant encoded by that 
KChlPl nucleic acid, or a particular pattern of splicing variants. 

Both such alterations (quantitative and qualitative) can also be present. The 
term "alteration" in the polypeptide expression or composition, as used herein, refers 
to an alteration in expression or composition in a test sample, as compared with the 
expression or composition of polypeptide by a KChlPl nucleic acid in a control 
sample. A control sample is a sample that corresponds to the test sample (e.g., is from 
the same type of cells), and is from an individual who is not affected by a 
susceptibility to a disease or condition associated with a KChlPl nucleic acid. An 
alteration in the expression or composition of the polypeptide in the test sample, as 
compared with the control sample, is indicative of a susceptibility to a disease or 
condition associated with a KChlPl nucleic acid. Similarly, the presence of one or 
more different splicing variants in the test sample, or the presence of significantly 
different amounts of different splicing variants in the test sample, as compared with 
the control sample, is indicative of a disease or condition associated with a KChlPl 
nucleic acid or a susceptibility to a disease or condition associated with a KChlPl 
nucleic acid. Various means of examining expression or composition of the 
polypeptide encoded by a KChlPl nucleic acid can be used, including: spectroscopy, 
colorimetry, lectrophoresis, isoelectric focusing, and immunoassays (e.g., David et 
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al, U.S. Pat. 4,376,1 10) such as immunoblotting (see also Current Protocols in 
Molecular Biology, particularly Chapter 10). For example, in one embodiment, an 
antibody capable of binding to the polypeptide (e.g., as described above), preferably 
an antibody with a detectable label, can be used. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 
F(ab')2) can be used. The term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or antibody by coupling (i.e., 
physically linking) a detectable substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with another reagent that is directly 
labeled. Examples of indirect labeling include detection of a primary antibody using a 
fluorescently labeled secondary antibody and end-labeling of a DNA probe with 
biotin such that it can be detected with fluorescently labeled streptavidin. 

Western blotting analysis, using an antibody as described above that 
specifically binds to a polypeptide encoded by an altered KChlPl nucleic acid (e.g., a 
KChlPl nucleic acid having one or more alterations as shown in Table 10), or an 
antibody that specifically binds to a polypeptide encoded by a non-altered nucleic 
acid, or an antibody that specifically binds to a particular splicing variant encoded by 
a nucleic acid, can be used to identify the presence in a test sample of a particular 
splicing variant or of a polypeptide encoded by a polymorphic or altered KChlPl 
nucleic acid, or the absence in a test sample of a particular splicing variant or of a 
polypeptide encoded by a non-polymorphic or non-altered nucleic acid. The presence 
of a polypeptide encoded by a polymorphic or altered nucleic acid, or the absence of a 
polypeptide encoded by a non-polymorphic or non-altered nucleic acid, is diagnostic 
for a disease or condition associated with a KChlPl nucleic acid or a susceptibility to 
a disease or condition associated with a KChlPl nucleic acid (e.g., Type II diabetes), 
as is the presence (or absence) of particular splicing variants encoded by the KChlPl 
nucleic acid. 

In one embodiment of this method, the level or amount of polypeptide 
encoded by a KChlPl nucleic acid in a test sample is compared with the level or 
amount of the polypeptide encoded by the KChlPl in a control sample. A level or 
amount of the polypeptide in the test sample that is higher or lower than the level or 
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amount of the polypeptide in the control sample, such that the difference is 
statistically significant, is indicative of an alteration in the expression of the 
polypeptide encoded by the KChlPl nucleic acid, and is diagnostic for a disease or 
condition associated with a KChlPl nucleic acid or a susceptibility to a disease or 
5 condition associated with that KChlPl nucleic acid (e.g., Type II diabetes). 

Alternatively, the composition of the polypeptide encoded by a KChlPl nucleic acid 
in a test sample is compared with the composition of the polypeptide encoded by the 
KChlPl nucleic acid in a control sample (e.g., the presence of different splicing 
variants). A difference in the composition of the polypeptide in the test sample, as 

10 compared with the composition of the polypeptide in the control sample, is diagnostic 
for a disease or condition associated with a KChlPl nucleic acid or a susceptibility to 
a disease or condition associated with that KChlPl nucleic acid (e.g., Type II 
diabetes). In another embodiment, both the level or amount and the composition of 
the polypeptide can be assessed in the test sample and in the control sample. A 

15 difference in the amount or level of the polypeptide in the test sample, compared to 
the control sample; a difference in composition in the test sample, compared to the 
control sample; or both a difference in the amount or level, and a difference in the 
composition, is indicative of a disease or condition associated with a KChlPl nucleic 
acid or a susceptibility to a disease or condition associated with that KChlPl nucleic 

20 acid. 

The invention further pertains to a method for the diagnosis or identification of 
a susceptibility to Type II diabetes in an individual, by identifying an at-risk 
haplotype (e.g., a haplotype comprising a KChlPl nucleic acid). The KChlPl - 
associated haplotypes, e.g., those described in Table 2, Table 4, Table 5 and Table 14, 

25 describe a set of genetic markers ("alleles"). In a certain embodiment, the haplotype 
can comprise one or more alleles, two or more alleles, three or more alleles, four or 
more alleles, or five or more alleles. The genetic markers are particular "alleles" at 
"polymorphic sites" associated with KChlPl . A nucleotide position at which more 
than one sequence is possible in a population (either a natural population or a 

30 synthetic population, e.g., a library of synthetic molecules), is referred to herein as a 
"polymorphic site". Where a polymorphic site is a single nucleotide in length, the site 



2345.2049-005 



-68- 

is referred to as a single nucleotide polymorphism ("SNP"). For example, if at a 
particular chromosomal location, one member of a population has an adenine and 
another member of the population has a thymine at the same position, then this 
position is a polymorphic site, and, more specifically, the polymorphic site is a SNP. 
5 Polymorphic sites can allow for differences in sequences based on substitutions, 

insertions or deletions. Each version of the sequence with respect to the polymorphic 
site is referred to herein as an "allele" of the polymorphic site. Thus, in the previous 
example, the SNP allows for both an adenine allele and a thymine allele. 

Typically, a reference sequence is referred to for a particular sequence. 

10 Alleles that differ from the reference are referred to as "variant" alleles. For example, 
the reference KChlPl sequence is described herein by SEQ ID NO: 1. The term, 
"variant KChlPl", as used herein, refers to a sequence that differs from SEQ ID NO: 
1, but is otherwise substantially similar. The genetic markers that make up the 
haplotypes described herein are KChlPl variants. The variants of KChlPl that are 

15 used to determine the haplotypes disclosed herein of the present invention are 
associated with Type II diabetes or a susceptibility to Type II diabetes. 

Additional variants can include changes that affect a polypeptide, e.g., the 
KChlPl polypeptide. These sequence differences, when compared to a reference 
nucleotide sequence, can include the insertion or deletion of a single nucleotide, or of 

20 more than one nucleotide, resulting in a frame shift; the change of at least one 

nucleotide, resulting in a change in the encoded amino acid; the change of at least one 
nucleotide, resulting in the generation of a premature stop codon; the deletion of 
several nucleotides, resulting in a deletion of one or more amino acids encoded by the 
nucleotides; the insertion of one or several nucleotides, such as by unequal 

25 recombination or gene conversion, resulting in an interruption of the coding sequence 
of a reading frame; duplication of all or a part of a sequence; transposition; or a 
rearrangement of a nucleotide sequence, as described in detail above. Such sequence 
changes alter the polypeptide encoded by a KChlPl nucleic acid. For example, if the 
change in the nucleic acid sequence causes a frame shift, the frame shift can result in 

30 a change in the encoded amino acids, and/or can result in the generation of a 

premature stop codon, causing generation of a truncated polypeptide. Alternatively, a 
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polymorphism associated with Type II diabetes or a susceptibility to Type II diabetes 
can be a synonymous change in one or more nucleotides (i.e., a change that does not 
result in a change in the amino acid sequence). Such a polymorphism can, for 
example, alter splice sites, affect the stability or transport of mRNA, or otherwise 
5 affect the transcription or translation of the polypeptide. The polypeptide encoded by 
the reference nucleotide sequence is the "reference" polypeptide with a particular 
reference amino acid sequence, and polypeptides encoded by variant alleles are 
referred to as "variant" polypeptides with variant amino acid sequences. 

Haplotypes are a combination of genetic markers, e.g., particular alleles at 

10 polymorphic sites. The haplotypes described herein, e.g., having markers such as 
those shown in Table 10, Table 11, Table 12 or Table 13, are found more frequently 
in individuals with Type II diabetes than in individuals without Type II diabetes. 
Therefore, these haplotypes have predictive value for detecting Type II diabetes or a 
susceptibility to Type II diabetes in an individual. The haplotypes described herein 

15 are a combination of various genetic markers, e.g., SNPs and microsatellites. 

Therefore, detecting haplotypes can be accomplished by methods known in the art for 
detecting sequences at polymorphic sites, such as the methods described above. 

HAPLOTYPE SCREENING 

20 In the methods for the diagnosis and identification of susceptibility to Type II 

diabetes or Type II diabetes in an individual, an at-risk haplotype is identified. In one 
embodiment, the at-risk haplotype is one which confers a significant risk of Type II 
diabetes. In one embodiment, significance associated with a haplotype is measured 
by an odds ratio. In a further embodiment, the significance is measured by a 

25 percentage. In one embodiment, a significant risk is measured as an odds ratio of at 
least about 1.2, including by not limited to: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, and 1.9. In 
a further embodiment, an odds ratio of at least 1.2 is significant. In a further 
embodiment, an odds ratio of at least about 1 .5 is significant. In a further 
embodiment, a significant increase in risk is at least about 1.7 is significant. In a 

30 further embodiment, a significant increase in risk is at least about 20%, including but 
not limited to about 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 
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80%, 85%, 90%, 95% and 98%. In a further embodiment, a significant increase in 
risk is at least about 50%. It is understood however, that identifying whether a risk is 
medically significant may also depend on a variety of factors, including the specific 
disease, the haplotype, and often, environmental factors. 
5 The invention also pertains to methods of diagnosing Type II diabetes or a 

susceptibility to Type II diabetes in an individual, comprising screening for an at-risk 
haplotype in, or comprising portions of, the KChlPlgene, where the haplotype is 
more frequently present in an individual susceptible to Type II diabetes (affected), 
compared to the frequency of its presence in a healthy individual (control), and 

10 wherein the presence of the haplotype is indicative of Type II diabetes or 
susceptibility to Type II diabetes. Standard techniques for genotyping for the 
presence of SNPs and/or microsatellite markers can be used, such as fluorescent based 
techniques (Chen, et ai, Genome Res. 9, 492 (1999)), PCR, LCR, Nested PCR and 
other techniques for nucleic acid amplification. In a preferred embodiment, the 

15 method comprises assessing in an individual the presence or frequency of SNPs 

and/or microsatellites in, comprising portions of, the KChlPlgene, wherein an excess 
or higher frequency of the SNPs and/or microsatellites compared to a healthy control 
individual is indicative that the individual has Type II diabetes or is susceptible to 
Type II diabetes. See, for example, Tables 6, 7, 9, 1 1, 13 and 14 (below) for SNPs 

20 and markers that can form haplotypes that can be used as screening tools. These 
markers and SNPs can be used to design diagnostic tests for determining Type II 
diabetes or a susceptibility to Type II diabetes. For example, an at-risk haplotype can 
include microsatellite markers and/or SNPs such as those set forth in Table 10, Table 
11, Table 12 Table 13 and/ or Table 14. The presence of the haplotype is diagnostic 

25 of Type II diabetes or of a susceptibility to Type II diabetes. Haplotype analysis 
involves defining a candidate susceptibility locus using LOD scores. The defined 
regions are then ultra-fine mapped with microsatellite markers with an average 
spacing between markers of less than lOOkb. All usable microsatellite markers that 
found in public databases and mapped within that region can be used. In addition, 

30 microsatellite markers identified within the deCODE genetics sequence assembly of 
the human genome can be used. 
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The frequencies of haplotypes in the patient and the control groups using an 
expectation-maximization algorithm can be estimated (Dempster A. et al. 9 1977. J. R. 
Stat. Soc. B 9 39:1-389). An implementation of this algorithm that can handle missing 
genotypes and uncertainty with the phase can be used. Under the null hypothesis, the 
5 patients and the controls are assumed to have identical frequencies. Using a 

likelihood approach, an alternative hypothesis where a candidate at-risk-haplotype, 
which can include the markers described herein, is allowed to have a higher frequency 
in patients than controls, while the ratios of the frequencies of other haplotypes are 
assumed to be the same in both groups is tested. Likelihoods are maximized 

10 separately under both hypotheses and a corresponding 1 -df likelihood ratio statistics is 
used to evaluate the statistic significance. 

To look for at-risk-haplotypes in the 1-lod drop, for example, association of all 
possible combinations of genotyped markers is studied, provided those markers span a 
practical region. The combined patient and control groups can be randomly divided 

15 into two sets, equal in size to the original group of patients and controls. The 
haplotype analysis is then repeated and the most significant p-value registered is 
determined. This randomization scheme can be repeated, for example, over 100 times 
to construct an empirical distribution of p-values. 

The at-risk haplotypes identified in Table 2 (haplotypes identified as Al, A2, 

20 A3, A4, A5, A6, Bl, B2, B3, B4 and B5), Table 4 (haplotypes identified as Dl and 
D2), Table 5 (haplotypes identified as D2, D3, D4, D5 and D6) or Table 14 
(haplotypes identified as Hap E and Hap E') are associated with Type II diabetes or a 
susceptibility to Type II diabetes. In certain embodiments, a haplotype associated 
with Type II diabetes or a susceptibility to Type II diabetes comprises markers 

25 DG5S879, DG5S881, D5S2075, DG5S883 and DG5S38 at the 5q35 locus; or 

DG5S1058 and DG5S37 at the 5q35 locus; or DG5S1058, DG5S37 and DG5S101 at 
the 5q35 locus; or DG5S881, DG5S1058, D5S2075, DG5S883 and DG5S38 at the 
5q35 locus; or DG5S879, DG5S1058 and DG5S37; or DG5S881, D5S2075, 
DG5S883 and DG5S38 at the 5q35 locus; DG5S953, DG5S955, DG5S13 and 

30 DG5S959 at the 5q35 locus; or DG5S888 and DG5S953 at the 5q35 locus; or 

DG5S953, DG5S955 and DG5S124 at the 5q35 locus; or DG5S888, DG5S44 and 
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DG5S953 at the 5q35 locus; or DG5S953, DG5S955, DG5S13, DG5S123, and 
DG5S959 at the 5q35 locus. The presence of the haplotype is diagnostic of Type II 
diabetes or of a susceptibility to Type II diabetes. Also described herein is a 
haplotype associated with Type II diabetes or a susceptibility to Type II diabetes 
5 comprising markers DG5S 13, KCP_1 1 52, and D5S625 at the 5q35 locus; the 
presence of the haplotype is diagnostic of Type II diabetes or of a susceptibility to 
Type II diabetes. In one particular embodiment, the presence of the -4, 1, 0 haplotype 
at DG5S13, KCP_1 152, and D5S625 is diagnostic of Type II diabetes or of a 
susceptibility to Type II diabetes. In another embodiment, a haplotype associated 

10 with Type II diabetes or a susceptibility to Type II diabetes in an individual, 

comprises markers DG5S124, KCPJ 152, KCPJ649, KPC J976 and KPC-16152 at 
the 5q35 locus. In one particular embodiment, the presence of the 0, 1, 1, 3 and 0 
haplotype at DG5S124, KCPJ 152, KCP_2649, KPC_4976 and KPC-16152 is 
diagnostic of Type II diabetes or of a susceptibility to Type II diabetes. In another 

15 embodiment, a haplotype associated with Type II diabetes or a susceptibility to Type 
II diabetes in an individual, comprises markers KCP_1 73982, KCPJ 5400, and 
KCP_18069. In one particular embodiment, the presence of the 0, 1, 1 haplotype at 
KCPJ 73982, KCPJ 5400, and KCPJ 8069 is diagnostic of Type II diabetes or of a 
susceptibility to Type II diabetes. 

20 In additional embodiments, a haplotype associated with Type II diabetes or a 

susceptibility to Type II diabetes comprises markers DG5S124, KCPJ 152, 
KCPJ649, KCPJ976, and KCPJ6152 at the 5q35 locus, as well as one of the 
following 3 markers: KCPJ97678, KCPJ97775, and KCP_202795 at the 5q35 
locus; the presence of the haplotype is diagnostic of Type II diabetes or of a 

25 susceptibility to Type II diabetes. In particular embodiments, the presence of the 0, 3, 
1, 1, 3, 0 haplotype at DG5S124, KCPJ97678, KCPJ 152, KCPJ649, KCPJ976, 
and KCPJ6152; the presence of the 0, 3, 1, 1, 3, 0 haplotype at DG5S124, 
KCPJ 97775, KCPJ 152, KCPJ649, KCP_4976, and KCPJ 61 52; or the presence 
of the 0, 1, 1, 1, 3, 0 haplotype at DG5S124, KCPJ02795, KCPJ 152, KCPJ649, 

30 KCP J976, and KCPJ 6 1 52; is diagnostic of Type II diabetes or of a susceptibility to 
Type II diabetes. 
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In additional embodiments, a haplotype associated with Type II diabetes or a 
susceptibility to Type II diabetes comprises markers rs 1032856, KCP_RS888934, 
KCP_93545, KCPJ 02882, 169234, KCP_1 86048 and KCPJ6152, as well as 
markers rsl032856, KCP_RS888934, KCP_93545, KCPJ 02882, 169234, 
5 KCPJ 86048, KCP_197775 and KCPJ6152 at the 5q35 locus; the presence of the 
haplotype is diagnostic of Type II diabetes or of a susceptibility to Type II diabetes. 
In particular embodiments, the presence of the G, G, T, C, G, G, A haplotype at 
rsl032856, KCP_RS888934, KCP_93545, KCP_102882, 169234, KCPJ 86048 and 
KCPJ6152, or the presence of the G, G, T, C, G, G, C, A haplotype at rs 1032856, 
10 KCP_RS888934, KCP_93545, KCPJ02882, 169234, KCPJ 86048, KCPJ97775 
and KCPJ6152 is diagnostic of Type II diabetes or of a susceptibility to Type II 
diabetes. 

Kits (eg., reagent kits) useful in the methods of diagnosis comprise 
components useful in any of the methods described herein, including for example, 

15 hybridization probes or primers as described herein (e.g., labeled probes or primers), 
reagents for detection of labeled molecules, restriction enzymes (e.g., for RFLP 
analysis), allele-specific oligonucleotides, antibodies which bind to altered or to non- 
altered (native) KChlPl polypeptide, means for amplification of nucleic acids 
comprising a KChlPl nucleic acid, or means for analyzing the nucleic acid sequence 

20 of a KChlPl nucleic acid or for analyzing the amino acid sequence of a KChlPl 
polypeptide as described herein, etc. In one embodiment, the kit for diagnosing a 
Type II diabetes or a susceptibility to Type II diabetes can comprise primers for 
nucleic acid amplification of a region in the KChlPl nucleic acid comprising an at- 
risk haplotype that is more frequently present in an individual having Type II diabetes 

25 or who is susceptible to Type II diabetes. The primers can be designed using portions 
of the nucleic acids flanking SNPs that are indicative of Type II diabetes. In a certain 
embodiment, the primers are designed to amplify regions of the KChlPl gene 
associated with an at-risk haplotype for Type II diabetes, as shown in Table 10 and 
13, or more particularly the haplotypes described in Tables 2, 4, 5 and 14. 



30 
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SCREENING ASSAYS AND AGENTS IDENTIFIED THEREBY 

The invention provides methods (also referred to herein as "screening assays") 
for identifying the presence of a nucleotide that hybridizes to a nucleic acid of the 
invention, as well as for identifying the presence of a polypeptide encoded by a 
nucleic acid of the invention. In one embodiment, the presence (or absence) of a 
nucleic acid molecule of interest (e.g., a nucleic acid that has significant homology 
with a nucleic acid of the invention) in a sample can be assessed by contacting the 
sample with a nucleic acid comprising a nucleic acid of the invention (e.g., a nucleic 
acid having the sequence of one of SEQ ID NOs: 1, 1 14-258, or the complement 
thereof, or a nucleic acid encoding an amino acid having the sequence of one of SEQ 
ID NOs: 2, or a fragment or variant of such nucleic acids), under stringent conditions 
as described above, and then assessing the sample for the presence (or absence) of 
hybridization. In one embodiment, high stringency conditions are conditions 
appropriate for selective hybridization. In another embodiment, a sample containing 
the nucleic acid molecule of interest is contacted with a nucleic acid containing a 
contiguous nucleotide sequence (e.g., a primer or a probe as described above) that is 
at least partially complementary to a part of the nucleic acid molecule of interest (e.g., 
a KChlPl nucleic acid), and the contacted sample is assessed for the presence or 
absence of hybridization. In another embodiment, the nucleic acid containing a 
contiguous nucleotide sequence is completely complementary to a part of the nucleic 
acid molecule of interest. 

In any of these embodiments, all or a portion of the nucleic acid of interest can 
be subjected to amplification prior to performing the hybridization. 

In another embodiment, the presence (or absence) of a polypeptide of interest, 
such as a polypeptide of the invention or a fragment or variant thereof, in a sample 
can be assessed by contacting the sample with an antibody that specifically hybridizes 
to the polypeptide of interest (e.g., an antibody such as those described above), and 
then assessing the sample for the presence (or absence) of binding of the antibody to 
the polypeptide of interest. 

In another embodiment, the invention provides methods for identifying agents 
(e.g., fusion proteins, polypeptides, peptidomimetics, prodrugs, receptors, binding 
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agents, antibodies, small molecules or other drugs, or ribozymes) which alter {e.g., 
increase or decrease) the activity of the polypeptides described herein, or which 
otherwise interact with the polypeptides herein. For example, such agents can be 
agents which bind to polypeptides described herein {e.g., KChlPl binding agents); 
5 which have a stimulatory or inhibitory effect on, for example, activity of polypeptides 
of the invention; or which change {e.g., enhance or inhibit) the ability of the 
polypeptides of the invention to interact with KChlPl binding agents {e.g., receptors 
or other binding agents); or which alter posttranslational processing of the KChlPl 
polypeptide {e.g., agents that alter proteolytic processing to direct the polypeptide 
10 from where it is normally synthesized to another location in the cell, such as the cell 
surface; agents that alter proteolytic processing such that more polypeptide is released 
from the cell, etc. 

In one embodiment, the invention provides assays for screening candidate or 
test agents that bind to or modulate the activity of polypeptides described herein (or 

15 biologically active portion(s) thereof), as well as agents identifiable by the assays. 
Test agents can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the 'one-bead one-compound' library method; and synthetic library 

20 methods using affinity chromatography selection. The biological library approach is 
limited to polypeptide libraries, while the other four approaches are applicable to 
polypeptide, non-peptide oligomer or small molecule libraries of compounds (Lam, 
K.S., Anticancer Drug Des. 12:145 (1997)). 

In one embodiment, to identify agents which alter the activity of a KChlPl 

25 polypeptide, a cell, cell lysate, or solution containing or expressing a KChlPl 
polypeptide, or another splicing variant encoded by a KChlPl gene (such as 
comprising a SNP as shown in Table 10 and/or 3), or a fragment or derivative thereof 
(as described above), can be contacted with an agent to be tested; alternatively, the 
polypeptide can be contacted directly with the agent to be tested. The level (amount) 

30 of KChlPl activity is assessed {e.g., the level (amount) of KChlPl activity is 

measured, either directly or indirectly), and is compared with the level of activity in a 
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control {i.e., the level of activity of the KChlPl polypeptide or active fragment or 
derivative thereof in the absence of the agent to be tested). If the level of the activity 
in the presence of the agent differs, by an amount that is statistically significant, from 
the level of the activity in the absence of the agent, then the agent is an agent that 
5 alters the activity of a KChlPl polypeptide. An increase in the level of KChlPl 
activity relative to a control, indicates that the agent is an agent that enhances (is an 
agonist of) KChlPl activity. Similarly, a decrease in the level of KChlPl activity 
relative to a control, indicates that the agent is an agent that inhibits (is an antagonist 
of) KChlPl activity. In another embodiment, the level of activity of a KChlPl 

10 polypeptide or derivative or fragment thereof in the presence of the agent to be tested, 
is compared with a control level that has previously been established. A level of the 
activity in the presence of the agent that differs from the control level by an amount 
that is statistically significant indicates that the agent alters KChlPl activity. 

The present invention also relates to an assay for identifying agents which 

15 alter the expression of a KChlPl nucleic acid (e.g., antisense nucleic acids, fusion 
proteins, polypeptides, peptidomimetics, prodrugs, receptors, binding agents, 
antibodies, small molecules or other drugs, or ribozymes) which alter (e.g., increase 
or decrease) expression (e.g., transcription or translation) of the gene or which 
otherwise interact with the nucleic acids described herein, as well as agents 

20 identifiable by the assays. For example, a solution containing a nucleic acid encoding 
a KChlPl polypeptide (e.g., a KChlPl gene or nucleic acid) can be contacted with an 
agent to be tested. The solution can comprise, for example, cells containing the 
nucleic acid or cell lysate containing the nucleic acid; alternatively, the solution can 
be another solution that comprises elements necessary for transcription/translation of 

25 the nucleic acid. Cells not suspended in solution can also be employed, if desired. 
The level and/or pattern of KChlPl expression (e.g., the level and/or pattern of 
mRNA or of protein expressed, such as the level and/or pattern of different splicing 
variants) is assessed, and is compared with the level and/or pattern of expression in a 
control (i.e., the level and/or pattern of the KChlPl expression in the absence of the 

30 agent to be tested). If the level and/or pattern in the presence of the agent differs, by 
an amount or in a manner that is statistically significant, from the level and/or pattern 
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in the absence of the agent, then the agent is an agent that alters the expression of a 
Type II diabetes gene. Enhancement of KChlPl expression indicates that the agent is 
an agonist of KChlPl activity. Similarly, inhibition of KChlPl expression indicates 
that the agent is an antagonist of KChlPl activity. In another embodiment, the level 
5 and/or pattern of KChlPl polypeptide(s) (e.g. , different splicing variants) in the 

presence of the agent to be tested, is compared with a control level and/or pattern that 
have previously been established. A level and/or pattern in the presence of the agent 
that differs from the control level and/or pattern by an amount or in a manner that is 
statistically significant indicates that the agent alters KChlPl expression. 

10 In another embodiment of the invention, agents which alter the expression of a 

KChlPl nucleic acid or which otherwise interact with the nucleic acids described 
herein, can be identified using a cell, cell lysate, or solution containing a nucleic acid 
encoding the promoter region of the KChlPl gene or nucleic acid operably linked to a 
reporter gene. After contact with an agent to be tested, the level of expression of the 

15 reporter gene (e.g., the level of mRNA or of protein expressed) is assessed, and is 
compared with the level of expression in a control (i.e., the level of the expression of 
the reporter gene in the absence of the agent to be tested). If the level in the presence 
of the agent differs, by an amount or in a manner that is statistically significant, from 
the level in the absence of the agent, then the agent is an agent that alters the 

20 expression of the KChlPl, as indicated by its ability to alter expression of a gene that 
is operably linked to the KChlPl gene promoter. Enhancement of the expression of 
the reporter indicates that the agent is an agonist of KChlPl activity. Similarly, 
inhibition of the expression of the reporter indicates that the agent is an antagonist of 
KChlPl activity. In another embodiment, the level of expression of the reporter in the 

25 presence of the agent to be tested is compared with a control level that has previously 
been established. A level in the presence of the agent that differs from the control 
level by an amount or in a manner that is statistically significant indicates that the 
agent alters expression. 

Agents which alter the amounts of different splicing variants encoded by a 

30 KChlPl nucleic acid (e.g., an agent which enhances activity of a first splicing variant, 
and which inhibits activity of a second splicing variant), as well as agents which are 
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agonists of activity of a first splicing variant and antagonists of activity of a second 
splicing variant, can easily be identified using these methods described above. 

In other embodiments of the invention, assays can be used to assess the impact 
of a test agent on the activity of a polypeptide in relation to a KChlPl binding agent. 
5 For example, a cell that expresses a compound that interacts with a KChlPl 
polypeptide (herein referred to as a "KChlPl binding agent", which can be a 
polypeptide or other molecule that interacts with a KChlPl polypeptide, such as a 
receptor) is contacted with a KChlPl in the presence of a test agent, and the ability of 
the test agent to alter the interaction between the KChlPl and the KChlPl binding 

10 agent is determined. Alternatively, a cell lysate or a solution containing the KChlPl 
binding agent, can be used. An agent which binds to the KChlPl or the KChlPl 
binding agent can alter the interaction by interfering with, or enhancing the ability of 
the KChlPl to bind to, associate with, or otherwise interact with the KChlPl binding 
agent. Determining the ability of the test agent to bind to a KChlPl nucleic acid or a 

15 KChlPl binding agent can be accomplished, for example, by coupling the test agent 
with a radioisotope or enzymatic label such that binding of the test agent to the 
polypeptide can be determined by detecting the labeled with 125 1, 35 S, 14 C or 3 H, either 
directly or indirectly, and the radioisotope detected by direct counting of 
radioemmission or by scintillation counting. Alternatively, test agents can be 

20 enzymatically labeled with, for example, horseradish peroxidase, alkaline 

phosphatase, or luciferase, and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. It is also within the scope of this 
invention to determine the ability of a test agent to interact with the polypeptide 
without the labeling of any of the interactants. For example, a microphysiometer can 

25 be used to detect the interaction of a test agent with a KChlPl polypeptide or a 
KChlPl binding agent without the labeling of either the test agent, KChlPl 
polypeptide, or the KChlPl binding agent. McConnell, H.M. et al. 9 Science 
257:1906-1912 (1992). As used herein, a "microphysiometer" (e.g., Cytosensor™) is 
an analytical instrument that measures the rate at which a cell acidifies its 

30 environment using a light-addressable potentiometric sensor (LAPS). Changes in this 
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acidification rate can be used as an indicator of the interaction between ligand and 
polypeptide. 

Thus, these receptors can be used to screen for compounds that are agonists or 
antagonists, for use in treating a susceptibility to a disease or condition associated 
5 with a KChlPl gene or nucleic acid, or for studying a susceptibility to a disease or 
condition associated with a KChlPl (e.g., Type II diabetes). Drugs could be designed 
to regulate KChlPl activation that in turn can be used to regulate signaling pathways 
and transcription events of genes downstream. 

In another embodiment of the invention, assays can be used to identify 

10 polypeptides that interact with one or more KChlPl polypeptides, as described herein. 
For example, a yeast two-hybrid system such as that described by Fields and Song 
(Fields, S. and Song, O., Nature 340:245-246 (1989)) can be used to identify 
polypeptides that interact with one or more KChlPl polypeptides. In such a yeast 
two-hybrid system, vectors are constructed based on the flexibility of a transcription 

15 factor that has two functional domains (a DNA binding domain and a transcription 
activation domain). If the two domains are separated but fused to two different 
proteins that interact with one another, transcriptional activation can be achieved, and 
transcription of specific markers (e.g., nutritional markers such as His and Ade, or 
color markers such as lacZ) can be used to identify the presence of interaction and 

20 transcriptional activation. For example, in the methods of the invention, a first vector 
is used which includes a nucleic acid encoding a DNA binding domain and also a 
KChlPl polypeptide, splicing variant, or fragment or derivative thereof, and a second 
vector is used which includes a nucleic acid encoding a transcription activation 
domain and also a nucleic acid encoding a polypeptide which potentially may interact 

25 with the KChIP 1 polypeptide, splicing variant, or fragment or derivative thereof (e.g. , 
a KChlPl polypeptide binding agent or receptor). Incubation of yeast containing the 
first vector and the second vector under appropriate conditions (e.g., mating 
conditions such as used in the Matchmaker™ system from Clontech (Palo Alto, 
California, USA)) allows identification of colonies that express the markers of 

30 interest. These colonies can be examined to identify the polypeptide(s) that interact 
with the KChlPl polypeptide or fragment or derivative thereof. Such polypeptides 
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may be useful as agents that alter the activity of expression of a KChlPl polypeptide, 
as described above. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either the KChlPl gene or nucleic acid, 
5 the KChlPl polypeptide, the KChlPl binding agent, or other components of the assay 
on a solid support, in order to facilitate separation of complexed from uncomplexed 
forms of one or both of the polypeptides, as well as to accommodate automation of 
the assay. Binding of a test agent to the polypeptide, or interaction of the polypeptide 
with a binding agent in the presence and absence of a test agent, can be accomplished 

10 in any vessel suitable for containing the reactants. Examples of such vessels include 
microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 
protein (e.g., a glutathione-S-transferase fusion protein) can be provided which adds a 
domain that allows a KChlPl nucleic acid, KChlPl polypeptide, or a KChlPl binding 
agent to be bound to a matrix or other solid support. 

15 In another embodiment, modulators of expression of nucleic acid molecules of 

the invention are identified in a method wherein a cell, cell lysate, or solution 
containing a KChlPl nucleic acid is contacted with a test agent and the expression of 
appropriate mRNA or polypeptide (e.g., splicing variant(s)) in the cell, cell lysate, or 
solution, is determined. The level of expression of appropriate mRNA or 

20 polypeptide(s) in the presence of the test agent is compared to the level of expression 
of mRNA or polypeptide(s) in the absence of the test agent. The test agent can then 
be identified as a modulator of expression based on this comparison. For example, 
when expression of mRNA or polypeptide is greater (statistically significantly 
greater) in the presence of the test agent than in its absence, the test agent is identified 

25 as a stimulator or enhancer of the mRNA or polypeptide expression. Alternatively, 
when expression of the mRNA or polypeptide is less (statistically significantly less) in 
the presence of the test agent than in its absence, the test agent is identified as an 
inhibitor of the mRNA or polypeptide expression. The level of mRNA or polypeptide 
expression in the cells can be determined by methods described herein for detecting 

30 mRNA or polypeptide. 
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This invention further pertains to novel agents identified by the above- 
described screening assays. Accordingly, it is within the scope of this invention to 
further use an agent identified as described herein in an appropriate animal model. 
For example, an agent identified as described herein (e.g., a test agent that is a 
5 modulating agent, an antisense nucleic acid molecule, a specific antibody, or a 

polypeptide-binding agent) can be used in an animal model to determine the efficacy, 
toxicity, or side effects of treatment with such an agent. Alternatively, an agent 
identified as described herein can be used in an animal model to determine the 
mechanism of action of such an agent. 

10 Furthermore, this invention pertains to uses of novel agents identified by the 

above-described screening assays for treatments as described herein. In addition, an 
agent identified as described herein can be used to alter activity of a polypeptide 
encoded by a KChlPl nucleic acid, or to alter expression of a KChlPl nucleic acid, 
by contacting the polypeptide or the nucleic acid (or contacting a cell comprising the 

15 polypeptide or the nucleic acid) with the agent identified as described herein. 

PHARMACEUTICAL COMPOSITIONS 

The present invention also pertains to pharmaceutical compositions 
comprising nucleic acids described herein, particularly nucleotides encoding the 

20 polypeptides described herein (e.g., a KChlPl polypeptide); comprising polypeptides 
described herein and/or comprising other splicing variants encoded by a KChlPl 
nucleic acid; and/or an agent that alters (e.g., enhances or inhibits) KChlPl nucleic 
acid expression or KChlPl polypeptide activity as described herein. For instance, a 
polypeptide, protein (e.g., a KChlPl nucleic acid receptor), an agent that alters 

25 KChlPl nucleic acid expression, or a KChlPl binding agent or binding partner, 

fragment, fusion protein or pro-drug thereof, or a nucleotide or nucleic acid construct 
(vector) comprising a nucleotide of the present invention, or an agent that alters 
KChlPl polypeptide activity, can be formulated with a physiologically acceptable 
carrier or excipient to prepare a pharmaceutical composition. The carrier and 

30 composition can be sterile. The formulation should suit the mode of administration. 
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Suitable pharmaceutically acceptable carriers include but are not limited to 
water, salt solutions (e.g., NaCl), saline, buffered saline, alcohols, glycerol, ethanol, 
gum arabic, vegetable oils, benzyl alcohols, polyethylene glycols, gelatin, 
carbohydrates such as lactose, amylose or starch, dextrose, magnesium stearate, talc, 
silicic acid, viscous paraffin, perfume oil, fatty acid esters, hydroxymethylcellulose, 
polyvinyl pyrolidone, etc., as well as combinations thereof. The pharmaceutical 
preparations can, if desired, be mixed with auxiliary agents, e.g., lubricants, 
preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic 
pressure, buffers, coloring, flavoring and/or aromatic substances and the like which do 
not deleteriously react with the active agents. 

The composition, if desired, can also contain minor amounts of wetting or 
emulsifying agents, or pH buffering agents. The composition can be a liquid solution, 
suspension, emulsion, tablet, pill, capsule, sustained release formulation, or powder. 
The composition can be formulated as a suppository, with traditional binders and 
carriers such as triglycerides. Oral formulation can include standard carriers such as 
pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, polyvinyl 
pyrollidone, sodium saccharine, cellulose, magnesium carbonate, etc. 

Methods of introduction of these compositions include, but are not limited to, 
intradermal, intramuscular, intraperitoneal, intraocular, intravenous, subcutaneous, 
topical, oral and intranasal. Other suitable methods of introduction can also include 
gene therapy (as described below), rechargeable or biodegradable devices, particle 
acceleration devises ("gene guns") and slow release polymeric devices. The 
pharmaceutical compositions of this invention can also be administered as part of a 
combinatorial therapy with other agents. 

The composition can be formulated in accordance with the routine procedures 
as a pharmaceutical composition adapted for administration to human beings. For 
example, compositions for intravenous administration typically are solutions in sterile 
isotonic aqueous buffer. Where necessary, the composition may also include a 
solubilizing agent and a local anesthetic to ease pain at the site of the injection. 
Generally, the ingredients are supplied either separately or mixed together in unit 
dosage form, for example, as a dry lyophilized powder or water free concentrate in a 
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hermetically sealed container such as an ampule or sachette indicating the quantity of 
active agent. Where the composition is to be administered by infusion, it can be 
dispensed with an infusion bottle containing sterile pharmaceutical grade water, saline 
or dextrose/water. Where the composition is administered by injection, an ampule of 
sterile water for injection or saline can be provided so that the ingredients may be 
mixed prior to administration. 

For topical application, nonsprayable forms, viscous to semi-solid or solid 
forms comprising a carrier compatible with topical application and having a dynamic 
viscosity preferably greater than water, can be employed. Suitable formulations 
include but are not limited to solutions, suspensions, emulsions, creams, ointments, 
powders, enemas, lotions, sols, liniments, salves, aerosols, etc., which are, if desired, 
sterilized or mixed with auxiliary agents, e.g., preservatives, stabilizers, wetting 
agents, buffers or salts for influencing osmotic pressure, etc. The agent may be 
incorporated into a cosmetic formulation. For topical application, also suitable are 
sprayable aerosol preparations wherein the active ingredient, preferably in 
combination with a solid or liquid inert carrier material, is packaged in a squeeze 
bottle or in admixture with a pressurized volatile, normally gaseous propellant, e.g., 
pressurized air. 

Agents described herein can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with free amino groups such 
as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and 
those formed with free carboxyl groups such as those derived from sodium, 
potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 2- 
ethylamino ethanol, histidine, procaine, etc. 

The agents are administered in a therapeutically effective amount. The 
amount of agents which will be therapeutically effective in the treatment of a 
particular disorder or condition will depend on the nature of the disorder or condition, 
and can be determined by standard clinical techniques. In addition, in vitro or in vivo 
assays may optionally be employed to help identify optimal dosage ranges. The 
precise dose to be employed in the formulation will also depend on the route of 
administration, and the seriousness of the symptoms, and should be decided according 
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to the judgment of a practitioner and each patient's circumstances. Effective doses 
may be extrapolated from dose-response curves derived from in vitro or animal model 
test systems. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Optionally associated with such containers) can be a 
notice in the form prescribed by a governmental agency regulating the manufacture, 
use or sale of pharmaceuticals or biological products, which notice reflects approval 
by the agency of manufacture, use of sale for human administration. The pack or kit 
can be labeled with information regarding mode of administration, sequence of drug 
administration (e.g., separately, sequentially or concurrently), or the like. The pack or 
kit may also include means for reminding the patient to take the therapy. The pack or 
kit can be a single unit dosage of the combination therapy or it can be a plurality of 
unit dosages. In particular, the agents can be separated, mixed together in any 
combination, present in a single vial or tablet. Agents assembled in a blister pack or 
other dispensing means is preferred. For the purpose of this invention, unit dosage is 
intended to mean a dosage that is dependent on the individual pharmacodynamics of 
each agent and administered in FDA approved dosages in standard time courses. 

METHODS OF THERAPY 

The present invention also pertains to methods of treatment (prophylactic 
and/or therapeutic) for certain diseases and conditions associated with KChlPl . In 
particular, the invention relates to methods of treatment for Type II diabetes or a 
susceptibility to Type II diabetes, using a Type II diabetes therapeutic agent. A "Type 
II diabetes therapeutic agent" is an agent that alters {e.g., enhances or inhibits) 
KChlPl polypeptide activity and/or KChlPl nucleic acid expression, as described 
herein (e.g., a Type II diabetes nucleic acid agonist or antagonist). In certain 
embodiments, the Type II diabetes therapeutic agent alters activity and/or nucleic acid 
expression of KChlPl . 

Type II diabetes therapeutic agents can alter KChlPl polypeptide activity or 
nucleic acid expression by a variety of means, such as, for example, by providing 
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additional KChlPl polypeptide or by upregulating the transcription or translation of 
the KChlPl nucleic acid; by altering posttranslational processing of the KChlPl 
polypeptide; by altering transcription of KChlP l splicing variants; or by interfering 
with KChlPl polypeptide activity (e.g., by binding to a KChlPl polypeptide), or by 
binding to another polypeptide that interacts with KChlPl, by altering (e.g., 
downregulating) the expression, transcription or translation of a KChlPl nucleic acid, 
or by altering (e.g., agonizing or antagonizing) activity. 

Representative Type II diabetes therapeutic agents include the following: 

nucleic acids or fragments or derivatives thereof described herein, particularly 
nucleotides encoding the polypeptides described herein and vectors 
comprising such nucleic acids (e.g., a gene, cDNA, and/or mRNA, such as a 
nucleic acid encoding a KChlPl polypeptide or active fragment or derivative 
thereof, or an oligonucleotide; or a complement thereof, or fragments or 
derivatives thereof, and/or other splicing variants encoded by a Type II 
diabetes nucleic acid, or fragments or derivatives thereof); 

polypeptides described herein and/ or splicing variants encoded by the 
KChlPl nucleic acid or fragments or derivatives thereof; 

other polypeptides (e.g., KChlPl receptors); KChlPl binding agents; or agents 
that affect (e.g., increase or decrease) activity, 

antibodies, such as an antibody to an altered KChlPl polypeptide, or an 
antibody to a non-altered KChlPl polypeptide, or an antibody to a particular 
splicing variant encoded by a KChlPl nucleic acid as described above; 

peptidomimetics; fusion proteins or prodrugs thereof; ribozymes; other small 
molecules; and 
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other agents that alter (e.g., enhance or inhibit) expression of a KChlPl 
nucleic acid, or that regulate transcription of KChlPl splicing variants (e.g., 
agents that affect which splicing variants are expressed, or that affect the 
amount of each splicing variant that is expressed). 

5 

More than one Type II diabetes therapeutic agent can be used concurrently, if desired. 

A Type II diabetes nucleic acid therapeutic agent that is a nucleic acid is used 
in the treatment of Type II diabetes or in the treatment for a susceptibility to Type II 
diabetes. The term, "treatment" as used herein, refers not only to ameliorating 

10 symptoms associated with the disease or condition, but also preventing or delaying 
the onset of the disease or condition, and also lessening the severity or frequency of 
symptoms of the disease or condition. The therapy is designed to alter (e.g., inhibit or 
enhance), replace or supplement activity of a KChlPl polypeptide in an individual. 
For example, a Type II diabetes therapeutic agent can be administered in order to 

15 upregulate or increase the expression or availability of the KChlPl nucleic acid or of 
specific splicing variants of KChlPl nucleic acid, or, conversely, to downregulate or 
decrease the expression or availability of the KChlPl nucleic acid or specific splicing 
variants of the KChlPl nucleic acid. Upregulation or increasing expression or 
availability of a native KChlPl gene or nucleic acid or of a particular splicing variant 

20 could interfere with or compensate for the expression or activity of a defective gene or 
another splicing variant; downregulation or decreasing expression or availability of a 
native KChlPl gene or of a particular splicing variant could minimize the expression 
or activity of a defective gene or the particular splicing variant and thereby minimize 
the impact of the defective gene or the particular splicing variant. 

25 The Type II diabetes therapeutic agent(s) are administered in a therapeutically 

effective amount (i.e., an amount that is sufficient to treat the disease, such as by 
ameliorating symptoms associated with the disease, preventing or delaying the onset 
of the disease, and/or also lessening the severity or frequency of symptoms of the 
disease). The amount which will be therapeutically effective in the treatment of a 

30 particular individual's disorder or condition will depend on the symptoms and 
severity of the disease, and can be determined by standard clinical techniques. In 
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addition, in vitro or in vivo assays may optionally be employed to help identify 
optimal dosage ranges. The precise dose to be employed in the formulation will also 
depend on the route of administration, and the seriousness of the disease or disorder, 
and should be decided according to the judgment of a practitioner and each patient's 
circumstances. Effective doses may be extrapolated from dose-response curves 
derived from in vitro or animal model test systems. 

In one embodiment, a nucleic acid of the invention (e.g., a nucleic acid 
encoding a KChlPl polypeptide, such as one of SEQ ID NO: 1 or a complement 
thereof); or another nucleic acid that encodes a KChlPl polypeptide or a splicing 
variant, derivative or fragment thereof (e.g., comprising any one or more of SEQ ID 
NO: 1 14-258), can be used, either alone or in a pharmaceutical composition as 
described above. For example, a KChlPl gene or nucleic acid or a cDNA encoding a 
KChlPl polypeptide, either by itself or included within a vector, can be introduced 
into cells (either in vitro or in vivo) such that the cells produce native KChlPl 
polypeptide. If necessary, cells that have been transformed with the gene or cDNA or 
a vector comprising the gene, nucleic acid or cDNA can be introduced (or re- 
introduced) into an individual affected with the disease. Thus, cells which, in nature, 
lack native KChlPl expression and activity, or have altered KChlPl expression and 
activity, or have expression of a disease-associated KChlPl splicing variant, can be 
engineered to express the KChlPl polypeptide or an active fragment of the KChlPl 
polypeptide (or a different variant of the KChlPl polypeptide). In certain 
embodiments, nucleic acids encoding a KChlPl polypeptide, or an active fragment or 
derivative thereof, can be introduced into an expression vector, such as a viral vector, 
and the vector can be introduced into appropriate cells in an animal. Other gene 
transfer systems, including viral and nonviral transfer systems, can be used. 
Alternatively, nonviral gene transfer methods, such as calcium phosphate 
coprecipitation, mechanical techniques (e.g., microinjection); membrane fusion- 
mediated transfer via liposomes; or direct DNA uptake, can also be used. 

Alternatively, in another embodiment of the invention, a nucleic acid of the 
invention; a nucleic acid complementary to a nucleic acid of the invention; or a 
portion of such a nucleic acid (e.g., an oligonucleotide as described below), can be 
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used in "antisense" therapy, in which a nucleic acid (e.g., an oligonucleotide) which 
specifically hybridizes to the mRNA and/or genomic DNA of a Type II diabetes gene 
is administered or generated in situ. The antisense nucleic acid that specifically 
hybridizes to the mRNA and/or DNA inhibits expression of the KChlPl polypeptide, 
e.g., by inhibiting translation and/or transcription. Binding of the antisense nucleic 
acid can be by conventional base pair complementarity, or, for example, in the case of 
binding to DNA duplexes, through specific interaction in the major groove of the 
double helix. 

An antisense construct of the present invention can be delivered, for example, 
as an expression plasmid as described above. When the plasmid is transcribed in the 
cell, it produces RNA that is complementary to a portion of the mRNA and/or DNA 
which encodes the KChlPl polypeptide. Alternatively, the antisense construct can be 
an oligonucleotide probe that is generated ex vivo and introduced into cells; it then 
inhibits expression by hybridizing with the mRNA and/or genomic DNA of the 
polypeptide. In one embodiment, the oligonucleotide probes are modified 
oligonucleotides, which are resistant to endogenous nucleases, e.g., exonucleases 
and/or endonucleases, thereby rendering them stable in vivo. Exemplary nucleic acid 
molecules for use as antisense oligonucleotides are phosphoramidate, phosphothioate 
and methylphosphonate analogs of DNA (see also U.S. Pat. Nos. 5,176,996; 
5,264,564; and 5,256,775). Additionally, general approaches to constructing 
oligomers useful in antisense therapy are also described, for example, by Van der 
Krol etal, (BioTechniques 6:958-976 (1988)); and Stein al t (Cancer Res. 48:2659- 
2668 (1988)). With respect to antisense DNA, oligodeoxyribonucleotides derived 
from the translation initiation site are preferred. 

To perform antisense therapy, oligonucleotides (mRNA, cDNA or DNA) are 
designed that are complementary to mRNA encoding the KChlPl . The antisense 
oligonucleotides bind to KChlPl mRNA transcripts and prevent translation. Absolute 
complementarity, although preferred, is not required. A sequence "complementary" 
to a portion of an RNA, as referred to herein, indicates that a sequence has sufficient 
complementarity to be able to hybridize with the RNA, forming a stable duplex; in the 
case of double-stranded antisense nucleic acids, a single strand of the duplex DNA 
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may thus be tested, or triplex formation may be assayed. The ability to hybridize will 
depend on both the degree of complementarity and the length of the antisense nucleic 
acid, as described in detail above. Generally, the longer the hybridizing nucleic acid, 
the more base mismatches with an RNA it may contain and still form a stable duplex 
5 (or triplex, as the case may be). One skilled in the art can ascertain a tolerable degree 
of mismatch by use of standard procedures. 

The oligonucleotides used in antisense therapy can be DNA, RNA, or 
chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotides can be modified at the base moiety, sugar 

10 moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotides can include other appended groups such as 
peptides (e.g. for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al, Proc. Natl. Acad. Sci. USA 
86:6553-6556 (1989); Lemaitre et ai, Proc. Natl. Acad. Sci. USA 84:648-652 (1987); 

15 PCT International Publication NO: WO 88/09810) or the blood-brain barrier (see, 
e.g., PCT International Publication NO: WO 89/10134), or hybridization-triggered 
cleavage agents (see, e.g., Krol et al, BioTechniques 6:958-976 (1988)) or 
intercalating agents. (See, e.g., Zon, Pharm. Res. 5:539-549 (1988)). To this end, the 
oligonucleotide may be conjugated to another molecule (e.g., a peptide, hybridization 

20 triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent). 

The antisense molecules are delivered to cells that express KChlPl in vivo. A 
number of methods can be used for delivering antisense DNA or RNA to cells; e.g., 
antisense molecules can be injected directly into the tissue site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or 

25 antibodies that specifically bind receptors or antigens expressed on the target cell 
surface) can be administered systematically. Alternatively, in a preferred 
embodiment, a recombinant DNA construct is utilized in which the antisense 
oligonucleotide is placed under the control of a strong promoter (e.g., pol III or pol 
II). The use of such a construct to transfect target cells in the patient results in the 

30 transcription of sufficient amounts of single stranded RNAs that will form 

complementary base pairs with the endogenous KChlPl transcripts and thereby 
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prevent translation of the KChlPl mRNA. For example, a vector can be introduced in 
vivo such that it is taken up by a cell and directs the transcription of an antisense 
RNA. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can 
be constructed by recombinant DNA technology methods standard in the art and 
described above. For example, a plasmid, cosmid, YAC or viral vector can be used to 
prepare the recombinant DNA construct that can be introduced directly into the tissue 
site. Alternatively, viral vectors can be used which selectively infect the desired 
tissue, in which case administration may be accomplished by another route (e.g., 
systemically). 

Endogenous KChlPl polypeptide expression can also be reduced by 
inactivating or "knocking out" the gene, nucleic acid or its promoter using targeted 
homologous recombination (e.g., see Smithies et al, Nature 317:230-234 (1985); 
Thomas & Capecchi, Cell 51:503-512 (1987); Thompson et al, Cell 5:313-321 
(1989)). For example, an altered, non-functional gene or nucleic acid (or a 
completely unrelated DNA sequence) flanked by DNA homologous to the 
endogenous gene or nucleic acid (either the coding regions or regulatory regions of 
the nucleic acid) can be used, with or without a selectable marker and/or a negative 
selectable marker, to transfect cells that express the gene or nucleic acid in vivo. 
Insertion of the DNA construct, via targeted homologous recombination, results in 
inactivation of the gene or nucleic acid. The recombinant DNA constructs can be 
directly administered or targeted to the required site in vivo using appropriate vectors, 
as described above. Alternatively, expression of non-altered genes or nucleic acids 
can be increased using a similar method: targeted homologous recombination can be 
used to insert a DNA construct comprising a non-altered functional gene or nucleic 
acid, e.g., a nucleic acid comprising one or more of SEQ ID NOs: 1 14-258 or the 
complement thereof, or a portion thereof, in place of an altered KChlPl in the cell, as 
described above. In another embodiment, targeted homologous recombination can be 
used to insert a DNA construct comprising a nucleic acid that encodes a Type II 
diabetes polypeptide variant that differs from that present in the cell. 
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Alternatively, endogenous KChlPl nucleic acid expression can be reduced by 
targeting deoxyribonucleotide sequences complementary to the regulatory region of a 
KChlPl nucleic acid {i.e., the KChlPl promoter and/or enhancers) to form triple 
helical structures that prevent transcription of the KChlPl nucleic acid in target cells 
5 in the body. (See generally, Helene, C, Anticancer Drug Des., 6(6):569-84 (1991); 
Helene, C. et al, Ann. N.Y. Acad. Sci. 660:27-36 (1992); and Maher, L. J., Bioassays 
14(12):807-15 (1992)). Likewise, the antisense constructs described herein, by 
antagonizing the normal biological activity of one of the KChlPl proteins, can be 
used in the manipulation of tissue, e.g., tissue differentiation, both in vivo and for ex 

10 vivo tissue cultures. Furthermore, the anti-sense techniques {e.g., microinjection of 
antisense molecules, or transfection with plasmids whose transcripts are anti-sense 
with regard to a Type II diabetes gene mRNA or gene sequence) can be used to 
investigate the role of KChlPl or the interaction of KChlPl and its binding agents in 
developmental events, as well as the normal cellular function of KChlPl or of the 

15 interaction of KChlPl and its binding agents in adult tissue. Such techniques can be 
utilized in cell culture, but can also be used in the creation of transgenic animals. 

In yet another embodiment of the invention, other Type II diabetes therapeutic 
agents as described herein can also be used in the treatment or prevention of a 
susceptibility to a disease or condition associated with a Type II diabetes gene. The 

20 therapeutic agents can be delivered in a composition, as described above, or by 

themselves. They can be administered systemically, or can be targeted to a particular 
tissue. The therapeutic agents can be produced by a variety of means, including 
chemical synthesis; recombinant production; in vivo production {e.g., a transgenic 
animal, such as U.S. Pat. NO: 4,873,316 to Meade et aL), for example, and can be 

25 isolated using standard means such as those described herein. 

A combination of any of the above methods of treatment {e.g., administration 
of non-altered polypeptide in conjunction with antisense therapy targeting altered 
mRNA of KChlPl; administration of a first splicing variant encoded by a KChlPl 
nucleic acid in conjunction with antisense therapy targeting a second splicing encoded 

30 by a KChlPl nucleic acid) can also be used. 
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The present invention is now illustrated by the following Exemplification, 
which is not intended to be limiting in any way. All references cited herein are 
incorporated by reference in their entirety. 

EXEMPLIFICATION 

The study was done in collaboration with the Icelandic Heart Association, who 
provided an encrypted list of 1350 diabetic patients. In 1967-1991 the Heart 
Association started a study of cardiovascular disease and its complications. 
Measurements of blood sugar were included in a thorough check-up of the 
participants which results led to many individuals being diagnosed with diabetes. The 
list of participants is an unbiased sample of about a third of the Icelandic nation. 
Individuals diagnosed in the years following 1991 were either diagnosed at the 
Icelandic Heart Association or at one of two major hospitals in Reykjavik, Iceland. 

All participants in the Type II diabetes study visited the Icelandic Heart 
Association where each answered a questionnaire, had blood drawn, a blood sugar 
assessment, and measurements taken. Height (m) and weight (kg) were measured to 
calculate the body mass index. In serum, the fasting blood glucose and triglyceride 
levels were measured as well. Diagnoses of Type II diabetes were based on the 
diagnostic criteria set by the World Health Organization (1999). All patients with 
fasting glucose above 7 mM were diagnosed as having Type II diabetes and 
individuals with fasting blood sugar between 6.1 - 6.9 mM were diagnosed with 
impaired fasting glucose. If the participants had no prior history of diabetes, they 
were requested to come in for another test to have their diagnosis confirmed. All 
individuals on diabetic medication were classified as Type II. The questionnaire 
included questions regarding age at diagnosis and type of medication. All patients 
were requested to bring two relatives who's DNA was used to confirm the 
genetotypes of the patients. 

Since the patients had participated in a study that was conducted between 
1967-1991 a considerable time had passed, in some instances, since they had visited 
the Heart Association. Therefore, all the patients were required to have another 
fasting blood glucose test to check on their blood sugar level at the time of 
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participation in the study. Thus, all patients were labeled unconfirmed, meaning that 
results of blood glucose levels were pending, for this particular study. A label of 
confirmed diabetic was given to the patient when the measurements were received. 
Linkage analyses were done with confirmed patients and unconfirmed patients were 
included only if they were close relatives of a confirmed index patient. The initial list 
of patients included 1350 Type II diabetics, but during this study new patients were 
diagnosed who were relatives of the index patients. All participants with no previous 
history of diabetes but with elevated fasting glucose were diagnosed according to the 
WHO criteria as described above. At present date, 1406 Type II diabetics and 266 
patients with impaired fasting glucose have participated in the study, together with 
3972 of their close relatives. 

This study was approved by the Data Protection Commission of Iceland and 
the National Bioethics Committee of Iceland. All patients and their relatives who 
participated in the Study gave informed consents. 

Outline of the study 

This particular genetic study, which has the aim of identifying a genetic 
variant or a gene that may contribute to type II diabetes by using a positional cloning 
approach, can be divided into three steps: 

i. Genome-wide linkage study, where excess allele sharing among related 
type II diabetics is used to identify a chromosomal segment, typically 2 
- 8 Megabases long, that may harbor a disease susceptibility 
gene/genes. 

ii. Locus-wide association study, where a high-density of microsatellite 
markers is typed in a large patient and control cohort. By comparing 
the frequencies of individual alleles or haplotypes between the two 
cohorts, the location of the putative disease gene/genes is narrowed 
down to a few hundred kilobases. 

iii. Candidate gene assessment, where additional microsatellites and/or 
SNPs are typed in all genes that are identified within the smaller 
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candidate region and further association analysis is used to identify 
which of the genes shows strong association to the disease. 

Linkage Analysis 
5 Pedigree Construction 

For the linkage analysis, blood samples were obtained from 964 Type II 
diabetics and 203 individuals with impaired fasting glucose. The patients were 
clustered into families such that each patient is related to (within and including six 
meiotic events) at least one other patient. In this manner, 772 patients fell into 

10 families - 705 Type II diabetics and 67 with impaired fasting glucose. The confirmed 
Type II patients were treated as probands and clustered into families that each 
proband is related to, within and including six meiotic events. The other patients, 
unconfirmed Type II and IFG patients, were added to the families if they were related 
to a proband within and including three meiotic events. The rational behind this was 

15 to include as many patients as possible in the study. Impaired fasting glucose is an 
immediate diagnosis, and we assumed that the more closely related these patients are 
to the confirmed diabetics, the likelier they are to have or to develop the disease. 

The families were checked for relationship errors by comparing the identity-by 
state (IBS) distribution for the set of 906 markers, for each pair of related and 

20 genotyped individuals, to a reference distribution corresponding to the particular 
degree of relatedness. The reference distributions were constructed from a large 
subset of the Icelandic population. Individuals were excluded from the study if their 
relationship with the rest of the family was inconsistent with the relationship specified 
in the geneology database. 

25 The remaining material that was available for the study was the following: 

763 now confirmed Type II patients in 227 families together with 764 genotyped 
relatives. Of the patients, 667 were confirmed Type II patients, 35 unconfirmed Type 
II patients, 52 confirmed patients with impaired fasting glucose (IFG) and 9 
unconfirmed patients with IFG. 
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Stratification of the Patient Material 

The patients were classified into two sub-phenotypes based on their BMI: 
non-obese Type II diabetes are patients who have BMI less than 30, and obese Type II 
diabetes are patients who have BMI at or above 30. The reason for fractionating the 
5 diabetics into non-obese and obese groups is that other factors may be influencing the 
pathogenesis of disease in these two groups. Obesity alone could be contributing to 
the diabetic phenotype. Therefore, this factor was separated. Obesity is most likely 
due to a combination of environmental and genetic factors. This fractionation into 
non-obese and obese diabetics practically separates the material into two halves; 60% 

10 of the patients are in the non-obese category (20% with BMI below 25 (lean) and 40% 
with BMI between 25-30 (overweight)), and 40% of the patients are in the obese 
category (BMI above 30). 

An affected-only linkage analysis for each of those sub-phenotypes was 
performed, using the same set of families as above, but classifying patients not 

15 belonging to the particular sub-group as having an unknown disease status. Restricted 
to a particular sub-phenotype, some families no longer contain a pair of related 
patients classified as affecteds and hence do not contribute in the linkage analysis. 
Such families were excluded from the analysis of the particular sub-phenotype. The 
number of patients and families used in the linkage analysis is summarized in Table 1 

20 below. 

Table 1 : The number of patients and families that contribute to the genome- 
wide linkage scan, both when all the patients are used, and when the analysis is 
restricted to obese or non-obese diabetic patients, respectively. 
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Table 1: Phenotype and Patients 



Phenotype 


Total Number 
of Patients 


NO: of families 
contributing to 
the analysis 


NO: of patients 
contributing to 
the analysis 


All diabetics 


763 


227 


763 


Obese 


296 


92 


219 


Non-obese 


467 


154 


413 



Genome wide scan 

5 A genome wide scan was performed on 772 patients and their relatives. Nine 

patients were excluded due to inheritance errors so the linkage analysis was 
performed with 763 patients and 764 relatives. The procedure was as described in 
Gretarsdottir, et ai 9 Am J Hum Genet., 70(3):593-603 (2002). In short, the DNA was 
genotyped with a framework marker set of 906 microsatellite markers with an average 

10 resolution of 4cM. Alleles were called automatically with the TrueAllele program 
(Cybergenetics, Co., Pittsburgh, PA), and the program DecodeGT (deCODE genetics, 
ehf, Iceland), was used to fractionate according to quality and edit the called 
genotypes (Palsson, B., etal, Genome Res., 9(10):1002-1012 (1999)). The 
population allele frequencies for the markers were constructed from a cohort of more 

15 than 30,000 Icelanders that have participated in genome-wide studies of various 

disease projects at deCODE genetics. Additional markers were genotyped within the 
locus on chromosome 5q, where we observed the strongest linkage signal, to increase 
the information on identity by descent (IBD) sharing within the families. For those 
markers, at least 180 Icelandic controls were genotyped to derive the population allele 

20 frequencies. 

The additional microsatellite markers that were genotyped within the locus 
were either publicly available or designed at deCODE genetics; those markers are 
indicated with a DG designation. Repeats within the DNA sequence were identified 
that allowed us to choose or design primers that were evenly spaced across the locus. 
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The identification of the repeats and location with respect to other markers was based 
on the work of the physical mapping team at deCODE genetics. 

For the markers used in the genomewide scan, the genetic positions were 
taken from the recently published high-resolution genetic map (HRGM), constructed 
5 at deCODE genetics (Kong A., et al, Nat Genet, 31: 241-247 (2002)). The genetic 
position of the additional markers are either taken from the HRGM, when available, 
or by applying the same genetic mapping methods as were used in constructing the 
HRGM map to the family material genotyped for this particular linkage study. 

1 0 Statistical Methods for Linkage Analysis 

The linkage analysis is done using the software Allegro (Gudbjartsson et al, 
Nat Genet 25:12-3, (2000)) that determines the statistical significance of excess 
sharing among related patients by applying non-parametric affected-only allele- 
sharing methods (without any particular disease inheritance model being specified). 

15 Allegro, a linkage program developed at deCODE genetics, calculates LOD scores 
based on multipoint calculations. Our baseline linkage analysis uses the S pa irs scoring 
function (Whittemore, A.S. and Halpern, J., Biometrics 50:1 18-27 (1994); Kruglyak 
L, et ai, Am J Hum Genet 58:1347-63, (1996)), the exponential allele-sharing model 
(Kong, A. and Cox, N.J., Am, J. Hum. Genet, 61:1 179 (1997)), and a family 

20 weighting scheme which is halfway on a log scale between weighting each affected 
pair equally and weighting each family equally. In the analysis, all genotyped 
individuals who are not affected are treated as "unknown". Because of concern with 
small sample behavior, we usually compute corresponding P-values in two different 
ways for comparison. The first P-value is computed based on large sample theory; Z h 

25 = V(2 loge (10) LOD) and is approximately distributed as a standard normal 

distribution under the null hypothesis of no linkage. A second P-value is computed 
by comparing the observed LOD score to its complete data sampling distribution 
under the null hypothesis. When a data set consists of more than a handful of 
families, these two P-values tend to be very similar. 

30 All suggestive loci with LOD scores greater than 2 are followed up with some 

extra markers to increase the information on the IBD-sharing within the families and 
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to decrease the chance that a LOD score represents a false-positive linkage. The 
information measure we use was defined by Nicolae (D. L. Nicolae, Thesis, 
University of Chicago (1999)) and is a part of the Allegro program output. This 
measure is closely related to a classical measure of information as previously 

5 described by Dempster etal (Dempster, A.P., et aL, J. R. Statist Soc. B, 39: 1 
(1977)); the information equals zero if the marker genotypes are completely 
uninformative and equals one if the genotypes determine the exact amount of allele 
sharing by descent among the affected relatives. Using the framework marker set 
with average marker spacing of 4 cM typically results in information content of about 

10 0.7 in the families used in our linkage analysis. Increasing the marker density to one 
marker every centimorgan usually increases the information content above 0.85. 

Results 

The results of the genome- wide linkage analysis with the framework marker 

15 set are shown in FIG. 4 which depicts the allele-sharing LOD-score versus the genetic 
distance from the p-terminus in centimorgan (cM) for each of the 23 chromosomes. 
The analysis was performed with the three phenotypes: all Type II diabetics (solid 
lines), non-obese diabetics (dashed lines) and obese diabetics (dotted lines). A LOD- 
score of 1.84 is observed on chromosome 5q34-q35.2 with the framework marker set 

20 when we use all Type II diabetics in the analysis. When the linkage analysis is 
restricted to non-obese diabetics, this LOD-score increases to 2.81. The obese 
diabetics do not show linkage in this region. 

Additional markers were genotyped in this area to increase the information 
content and to confirm the linkage. The information on the IBD-sharing at this locus 

25 was about 78% with the framework marker set. In order to increase the information 
content, another 38 microsatellite markers were genotyped within a 40 cM region that 
includes the observed signal. Repeating the linkage analysis including the additional 
markers increased the LOD-score to 3.64 (P-value = 3.18xl0' 5 ) for the non-obese 
diabetics. For all patients, the peak LOD-score increased to 2.9 (P-value = 1.22x10^). 

30 This is shown in FIG. 5. 
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The peak of the LOD-score is centered on marker D5S625 and the region 
determined by a drop of one in the LOD is from marker DG5S5 to marker D5S429, 
centromeric and telomeric respectively. The one-LOD-drop is about 9 cM and 
estimated to be about 3.5 Mb. This 1-LOD-drop roughly corresponds to the 80-90% 
5 confidence interval for the location of a putative disease associated gene. 

Locus-wide association study 

Genotyping to Narrow Down the Region of Linkage 

In order to narrow down the region of interest, the linkage analysis is followed 

10 by a comprehensive association study of the 1-LOD-drop. This is necessary as the 
linkage analysis has limited resolution; it compares sharing among closely related 
individuals that share on average large chromosomal segments. For the association 
analysis, we identified a large number of additional microsatellite markers located in 
the 1-LOD-drop and typed those markers in both our patient cohort and in a large 

15 number of unrelated controls randomly selected from the Icelandic population. 

We identified and typed 67 markers in the 1-LOD-drop in addition to the 17 
markers already typed and used in the linkage analysis (locus-wide association 
micorsatellites; Table 6). The new polymorphic repeats (dinucleotide or trinucleotide 
repeats) were identified with the Sputnik program. We subtracted the smaller allele of 

20 CEPH sample 1347-02 (CEPH genomics repository) from the alleles of the 

microsatellites and used it as a reference. A total of 84 markers were available for the 
association analysis, i.e., an average density of one marker every 42kb or one marker 
every 0.107 cM. All those markers were typed for 590 non-obese diabetics and 477 
unrelated controls. 

25 

Statistical Methods for Association and Haplotype Analysis 

For single marker association to the disease, we use Fisher exact test to 
calculate a two-sided P-value for each individual allele. When presenting the results, 
we use allelic frequencies rather than carrier frequencies for microsatellites, SNPs and 
30 haplotypes. Haplotype analyses are performed using a computer program we 
developed at deCODE called NEMO (NEsted MOdels) (Gretarsdottir, et ai, Nat 
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Genet. 2003 Oct;35(2):131-8). We use NEMO both to study marker-marker 
association and to calculate linkage disequilibrium (LD) between markers, and for 
case-control haplotype analysis. With NEMO, haplotype frequencies are estimated by 
maximum likelihood and the differences between patients and controls are tested 
5 using a generalized likelihood ratio test. The maximum likelihood estimates, 

likelihood ratios and P- values are computed with the aid of the EM-algorithm directly 
for the observed data, and hence the loss of information due to the uncertainty with 
phase and missing genotypes is automatically captured by the likelihood ratios, and 
under most situations, large sample theory can be used to reliably determine statistical 

10 significance. The relative risk (RR) of an allele or a haplotype, i.e., the risk of an 
allele compared to all other alleles of the same marker, is calculated assuming the 
multiplicative model (Terwilliger, J.D. & Ott, J. A haplotype-based 'haplotype relative 
risk 1 approach to detecting allelic associations. Hum HeredM, 337-46 (1992) and 
Falk, C.T. & Rubinstein, P. Haplotype relative risks: an easy reliable way to construct 

15 a proper control sample for risk calculations. Ann Hum Genet 51 ( Pt 3), 227-33 
(1987)), together with the population attributable risk (PAR). 

In the haplotype analysis, it may be useful to group haplotypes together and 
test the group as a whole for association to the disease. This is possible to do with 
NEMO. A model is defined by a partition of the set of all possible haplotypes, where 

20 haplotypes in the same group are assumed to confer the same risk while haplotypes in 
different groups can confer different risks. A null hypothesis and an alternative 
hypothesis are said to be nested when the latter corresponds to a finer partition than 
the former. NEMO provides complete flexibility in the partition of the haplotype 
space. In this way, it is possible to test multiple haplotypes jointly for association and 

25 to test if different at-risk haplotypes confer different risk. As a measure of LD, we use 
two standard definitions of LD, D' and R 2 (Lewontin, R., Genetics, 49:49-67 (1964) 
and Hill, W.G. and A. Robertson, Theor. Appl Genet, 22:226-231 (1968)) as they 
provide complementary information on the amount of LD. For the purpose of 
estimating D' and R 2 , the frequencies of all two-marker allele combinations are 

30 estimated using maximum likelihood methods and the deviation from linkage 

disequilibrium is evaluated using a likelihood ratio test. The standard definitions of 
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D' and R 2 are extended to include microsatellites by averaging over the values for all 
possible allele combinations of the two markers weighted by the marginal allele 
probabilities. 

The number of possible haplotypes that can be constructed out of the dense set 
5 of markers genotyped in the 1-LOD-drop is very large and even though the number of 
haplotypes that are actually observed in the patient and control cohort is much 
smaller, testing all those haplotypes for association to the disease is a formidable task 
Note that we do not restrict our analysis to haplotypes constructed from a set of 
consecutive markers, as some markers may be very mutable and might split up an 

10 otherwise well conserved haplotype constructed out of surrounding markers. 

The approach we take to the problem of identifying those haplotypes in the 
candidate region that show strongest association to the disease is two-fold. First, we 
restrict the haplotypes we test to span a sub-region small enough that the included 
markers may be expected to be in substantial LD. In this study, we only consider 

15 haplotypes that span less than 300kb. Second, we apply an iterative procedure that 
gradually builds up the most significant haplotypes. Starting with haplotypes 
constructed out of 3 markers, we select those haplotypes that show strong association 
to the disease, add other nearby markers to those haplotypes and repeat the association 
test. By iterating this procedure, we expect to identify those haplotypes that show 

20 strongest association to the disease. 

Results 

For the association analysis, we genotyped 590 non-obese Icelandic Type II 
diabetes patients and 477 unrelated population controls using a total of 84 

25 microsatellite markers. These markers are distributed evenly across a region of 

approximately 3.5 Mb. The region is centered on our linkage peak and corresponds to 
the 1-LOD-drop. We then applied the procedure described above and looked for 
single-markers and haplotypes consisting of up to 5 markers that showed association 
to the disease. The result is summarized in FIG. 6. In FIG. 6, we show the location of 

30 a marker or a haplotype on the horizontal axis and the corresponding P-value from the 
associaton test on the vertical axis. This is shown for all haplotypes tested that have a 
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P- value less than 0.01 . The horizontal bars indicated the size of the corresponding 
haplotypes and the location of all markers is shown at the bottom of the figure. All 
locations are in Mb and refer to the NCBI Build33. 

We observe a series of correlated haplotypes that show strong association for 
5 non-obese diabetics in two locations within the 1-LOD-drop. We denote those 

regions A (168.37 - 168.83Mb) and B (169.70 - 170.17Mb), and in Table 10 we list 
the most significant haplotype in each of those regions. For each haplotype, the table 
includes a two-sided single-test P-value for association, calculated using NEMO, the 
corresponding relative risk, the estimated frequency of the haplotype in the patient 

10 and the control cohorts, the region the haplotype spans, and the markers and alleles (in 
bold) that define the haplotype. 

Note, however, that some of the haplotypes listed within each of the two 
regions are very correlated and should be considered as a single observation of 
association to the disease. This is demonstrated for region B in Table 3, which lists 

15 the pairwise correlation, both D' and R 2 , between the haplotypes. Based on the 

correlation, we observe that haplotypes B2 and B4 are strongly correlated and should 
be considered as a single observation of association to this region. Likewise, 
haplotypes Bl and B5 are strongly correlated. However, haplotypes Bl, B2 and B3 
are all weakly correlated with each other; and in fact, Bl and B2 are mutually 

20 exclusive, i.e., never appear jointly on the same chromosome. These three haplotypes 
hence constitute three almost independent observations of association to non-obese 
diabetes of this region within the locus. It is possible to test haplotypes Bl, B2 and 
B3 together as a group for association to non-obese diabetes. This test yields a P- 
value = 8.5x1 0" 8 with a corresponding relative risk of 5.2, a population attributable 

25 risk of 1 3.9%, and an allelic frequency of 0.089 and 0.0 1 8 in the patient and the 
control cohorts, respectively. 
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Table 2 
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Table 2: Haplotypes within the 1-LOD-drop that show the strongest association to 
non-obese diabetes. For each haplotype, we show (i) a two-sided P-value for a single 
test of association to non-obese diabetes, (it) the corresponding relative risk (RR), (Hi) 
the estimated allelic frequency of the haplotype in the patient and the control cohort, 
(iv) the span of the haplotype (refering to NCBI 33) and (v) the alleles (in bold) and 
markers that define the haplotype. The haplotypes are separated into two groups, A 
and B, corresponding to two different regions within the 1-LOD-drop. 
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Table 3 
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Table 3: Pairwise correlation between the five haplotypes in the B-region that show the 
strongest association to non-obese diabetes. Estimates of D' are shown in the upper right 
5 corner, and estimates of R 2 are shown the the lower left corner. The haplotypes are labelled 
Bl, ...,B5 as in Table 2. 

Investigation of Region B 
Genes in Region B 

10 We next identified all genes in and around region B (UCSC). In the region 

defined by the five most significant haplotypes, 169.70 - 170.17 Mb, there are four 
genes, LCP2 (lymphocyte cytosolic protein 2), KCNMB1 (potassium large 
conductance calcium-activated channel, subfamily M, beta member 1), KChIPl(Kv 
channel interacting protein 1) and GABRP (gamma-aminobutyric acid (GAB A) A 

15 receptor, pi). Of those genes, KChlPl is by far the largest, stretching from 169.7 to 
170.1 MB, or almost the entire span of the observed haplotype association. The other 
three genes are small. In addition, there is a big gene, RANBP1 7 (RAN binding 
protein 17), just telomeric of the location of the observed association signal. The 
relative location of all the genes is shown in FIG. 7, which shows the location of the 

20 exons of KCHIP1 as solid bars, and the location of the other genes as shaded boxes. 
In addition, FIG. 7 shows the location of the microsatellites (filled boxes) that we 
have typed in this region and the location of the at-risk haplotypes Bl, B5 (gray 
horizontal lines). 
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Description of new Splice Variants of KChlPl Identified by RACE and PCR 

The published sequence for KChlPl comprises exons 1 to 8. New exons 
belonging to the KChlPl gene and four different splice variants were discovered by 
performing RACE or PCR (primers within the exons) using as template human 
5 Marathon cDNA and cDNA prepared from rat pancreatic INS1 beta cells. In all, 6 
new exons located in the 5' region of the gene were discovered.. An alternative exon 

1 was found that we call exon la. Here, we label the published sequence for exon 1 
with a "b" to distinguish it from the alternative exon 1, exon la. Four exons are 
called UTR 1, UTR 2, UTR 3 and UTR 4, or untranslated region 1 - 4, because they 

10 lie upstream of exon lb and they are not translated. The last exon to be identified is 
called Ins-r, or insert rodent, because it was known to be present in mouse and rat, and 
has recently been demonstrated by others to be present in humans as well (Boland et 
al, Am J Physiol Cell Physiol 285, C161-170. (2003)). See nucleotide sequences of 
the new exons below, as well as their location in the genomic sequence of NCBI build 

15 33. Even if not mentioned, all new variants of KChlPl found and described below 
include exons 2 - 8 of the published sequence. 

Splice variant 1 consists of exon la, UTR1, UTR2, UTR3, UTR4 and exon lb. 
Exon la is untranslated and the resulting protein is identical in amino acid sequence to 
KChlPl described by An et al {Nature 430, 553-556 (2000), see also FIG.2). This 

20 variant was observed in human heart and testis and the rat INS 1 cell line. 

Splice variant 2 consists of exon lb and the Ins-r exon giving rise to a protein 
that is identical in amino acid sequence to KChlPl described by Boland et al . This 
variant was observed in human brain, heart, pancreas and the rat INS1 cell line. 

Splice variant 3 consists of exon la and is identical in nucleotide sequence to 

25 AL538404, an EST in NCBI. The amino acid sequence of the N-terminus coded by 
exon la is unique (see sequence below) but the amino acid sequence coded by exons 

2 - 8 is that of the published sequence. This variant was observed in human brain, 
heart, pancreas, skeletal muscle, adipose tissue, liver, hypothalamus, small intestine, 
testis and the rat INS1 cell line. 
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Splice variant 4 consists of exons la and UTR1, which would result in a 
protein translated from exons 2-8. The second methionine in exon 2 has a Kozak 
sequence. This variant was observed in human heart. 

The nucleotide sequences of the new exons are as follows (the genomic 
locations given are from NCBI build 33, see also Table 8): 



Exon 1 a: 1 697 1 6298 - 1 697 1 65 1 1 (Build 33) 

GGCTTCAGGGGTGCATCCGTCACTCAGGGTTCATTCACCCAGGCAGGCTCCAAGT 
TCCTGGGGTGCACAAGGTGGGCACTGTCCCTTCTGGGTGCTGACAGCAGAGCCTG 
GCTCCCCTCCGCCACCATGAGCGGCTGCTCCAAAAGATGCAAGCTTGGGTTCGTG 
AAATTTGCCCAGACCATCTTTAAGCTCATCACTGGGACCCTCAGCAAAG (SEQ ID 
NO: 4) 

UTR 1 : 1 698484 1 7- 1 69848523 (Build 33) 

ACTCAGCATCATCAAGACTGGAGGGACAGAGCATTTGAATCATCAGACGCTGGGC 
CAGACGTCACCCCACGCGTTTTCTCATTTTATC GTCCTAAGAAGCCCAGAAG (SEQ 
ID NO: 5) 

UTR 2: 169861083-169861 154 (Build 33) 

CCTGAATGCAATTTGCAATGAGGAGATGATTTGATTTTCTTCAGCCCTAGACCTCC 
AGCTTCCTGAGAGCAG(SEQ ID NO: 6) 

UTR 3: 169864589-169864679 (Build 33) 

GGGTTCCCCAGGAGACCACGACAGAGGCCTGGAACCCAAGTTCTAATCCCACATC 
CTGGCTGGGCAACTTCAGGCAAATTTCTAACACAAG (SEQ ID NO: 7) 

UTR 4: 169867066-169867173 (Build 33) 

GGTAGGGGAGGGGCCGGGCCCGGGGTCCCAACTCGCACTCAAGTCTTCGCTGCCA 
TGGGGGCCGTCATGGGCACCTTCTCATCTCTGCaAACCAAACAAAGGCGACCC 
(SEQ ID NO: 8) 

Ins-r 170075401-170075433 

ACATCGCCTGGTGGTATTACCAGTATCAGAGAG (SEQ ID NO: 9) 

The nucleotide sequence derived from splice variant 4 (KChlPl .4) with the 
ATG and a Kozak sequence ((G/ANNATGG) underlined is as follows: 



ATAAGATTGAAGATGAGCTGGAGATGACCATGGTTTGCCATCGGCCCGAGGGACT 

GGAGCAGCTCGAGGCCCAGACCAACTTCACCAAGAGGGAGCTGCAGGTCCTTTAT 

CGAGGCTTCAAAAATGAGTGCCCCAGTGGTGTGGTCAACGAAGACACATTCAAGC 

AGATCTATGCTCAGTTTTTCCCTCATGGAGATGCCAGCACGTATGCCCATTACCTC 

TTCAATGCCTTCGACACCACTCAGACAGGCTCCGTGAAGTTCGAGGACTTTGTAAC 

CGCTCTGTCGATTTTATTGAGAGGAACTGTCCACGAGAAACTAAGGTGGACATTT 

AATTTGTATGACATCAACAAGGACGGATACATAAACAAAGAGGAGATGATGGAC 

ATTGTCAAAGCCATCTATGACATGATGGGGAAATACACATATCCTGTGCTCAAAG 

AGGACACTCCAAGGCAGCATGTGGACGTCTTCTTCCAGAAAATGGACAAAAATAA 

AGATGGCATCGTAACTTTAGATGAATTTCTTGAATCATGTCAGGAGGACGACAAC 

ATCATGAGGTCTCTCCAGCTGTTTCAAAATGTCATGTAACTGGTGACACTCAGCCA 

TTCAGCTCTCAGAGACATTGTACTAAACAACCACCTTAACACCCTGATCTGCCCTT 

GTTCTGATTTTACACACCAACTCTTGGGACAGAAACACCTTTTACACTTTGGAAGA 
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ATTCTCTGCTGAAGACTTTCTATGGAACCCAGCATCATGTGGCTCAGTCTCTGATT 
GCCAACTCTTCCYCTTTCTTCTTCTTGAGAGAGA (SEQ ID NO: 10) 



The protein sequences resulting from the splice variants are as follows: 
KChIPl.3 

(The amino acid sequence derived from splice variant 3 (KChIPl.3), the underlined amino 
acids are coded by exon la.) 

MSGCSKJICKLGFVKFAQTIFKLITGTLSKD KJEDELEMTMVCHRPRGT^QT.RAQTN^ 
KRELQVLYRGFKJsIECPSGVVNEDTFKQIYAQFFPHGDASTYAHYLFNAFDTTQTGSV 
KFEDF\aALSILLRGTVHEKXRWTFNLYDI^DGYINKEEMMDIVKAIYDMMGKYTY 
PVLKEDTPRQHVDVFFQKMDKNKDGIVTLDEFLESCQEDDNIMRSLQLFQNVM (SEQ 
ID NO: 11) 

KChIPl.2 

(The amino acid sequence derived from splice variant 2 (KChlPl. 2), the underlined amino 
acids are coded by exon Ins-r.) 

MGAVMGTFSSLOTKQRRPSK JDIAWWYYOYOR DKIEDELEMTMVCHRPFrir FQT .R A 
QTNFTKRELQVLYRGFKNECPSGVWEDTFKQIYAQFFPHGDASTYAHYLFNAFDTT 
QTGSVKFEDFWALSILLRGTVHEKLRWTFNLYDINKJDGYINKEEMMDIVKAIYDMM 
GKYTYPVLKEDTPRQHVDVFFQKMDKNKDGIVTLDEFLESCQEDDNIMRSLQLFQNV 
M (SEQ ID NO: 12) 

KChIP1.4 

(The amino acid sequence derived from splice variant 4 (KChlPl. 4).) 

MVCHRPEGLEQLEAQTNFTKRELQVLYRGFKNECPSGVVNEDTFKQIYAQFFPHGDA 
STYAHYLFNAFDTTQTGSVKFEDFVTALSILLRGTVHEKLRWTFNLYDINKDGYINKE 
EMMDIVKLAIYDMMGKYTYP\^KEDTPRQHVDVFFQKMDKNKI)GIVTLDEFLESCQE 
DDNIMRSLQLFQN VM (SEQ ID NO: 1 3) 



Identification of SNPs and Microsatellites 

In order to identify SNPs across KChlPl, all exons of KChlPl and their 
flanking regions were sequenced on 94 non-obese diabetic patients. As a 
consequence, 31 SNPs were identified (Table 9). Additional SNPs were identified 
across the gene by selecting SNPs from the public domain (US National Center for 
Biotechnology Information's SNP database) and designing SNP assays for them. 
(Table 10). 

We genotyped SNPs on 470 non-obese diabetics and 658 population-based 
controls using a method for detecting SNPs with fluorescent polarization template- 
directed dye-terminator incorporation (SNP-FP-TDI assay) (Chen, X., Zehnbauer, B., 
Gnirke, A. & Kwok, P.Y. Proa Natl. Acad. Sci. USA 94, 10756-10761 (1997)). 
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Association Study of Genes in Region B 

We tested all the genes in and around Region B {LCP2, KCNMB1, KChlPl, 
GABRP and RANBP1 7) individually for association to non-obese diabetes. In the 
analysis of each gene, we included all SNPs identified, and previously typed 
5 microsatellites, in and close to that gene. The association analysis was carried out in 
the same way as the locus-wide association, i.e., using the iterative approach, we 
search for haplotypes, shorter than 300kb, that showed strongest association to the 
disease. 

The strongest association observed was for KChIPL For KChlPl, we tested 
10 25 markers, 7 microsatellites and 18 SNPs, for association (Table 1 1). The strongest 
association signal was observed in the 3 '-end of the gene; a three marker haplotype 
with a P-value = 9.2xl0~ 5 , relative risk 12, and allelic frequency 3.6% and 0.3% in the 
patient and control cohorts, respectively. This haplotype, which extends over the last 
8 exons of KChlPl, from 169.96 to 170.1 1 Mb, is listed in Table 4 as Dl. We also 
15 observed another haplotype in the same region that showed association to non-obese 
diabetes, albeit less significant than Dl, with a P-value = 0.037, relative risk 1.69 and 
allelic frequency 7.8% and 4.8% in the patient and the control cohorts, respectively. 
This haplotype is labelled D2 in Table 4. For risk haplotypes, the corresponding 
population attributable risk is PAR = 4.9% for Dl and PAR = 4.7% for D2. However, 
20 as Dl and D2 are independent haplotypes, i.e., they do not appear jointly on the same 
chromosome, their population attributable risk can be added together. 



Table 4 





P-Value 


RR 


Aff.frq. 


CtrLfrq 


Haplotype 


Icelandic 












Dl 










-4 DG5S13 C KCP 1152 


9.20E-05 


12 


0.036 


0.003 


0 D5S625 


D2 










0 DG5S124 C KCP 1152C 












KCP 2649 T KCP 4976 A 




0.037 


1.69 


0.078 


0.048 


KCP 16152 


Danish 












Dl 










-4 DG5S13 C KCP 1152 0 


0.052* 


2.98 


0.031 


0.011 


D5S625 


D2 










0 DG5S124 C KCP 1152 C 












KCP 2649 T KCP 4976 A 




0.002* 


2.74 


0.098 


0.038 


KCP 16152 



* One-sided P-value 
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Table4: Microsatellite and SNP haplotype association within KChIPL The two 
independent haplotypes D 1 and D2 are located in the 3'-end of the gene, from 1 69.96 - 1 70. 1 1 
Mb. Shown are results of a test of association for non -obese diabetics vs population controls 
for both haplotypes in a cohort of Icelandic diabetics (top) and a replication in a cohort of 
5 Danish diabetics (bottom). Note that we report one-sided P-values for the test on the Danish 
cohort as that is a replication of association results previously observed in the Icelandic 
cohort. 

Replication in a Cohort of Danish Diabetics 

10 We typed the markers that define the two at-risk haplotypes, Dl and D2, in a 

cohort of 149 non-obese Danish females that have been diagnosed with diabetes 
and/or measured >7mM glucose who participated in a Danish PERF (Prospective 
Epidemiological Risk Factors) study. As controls, we used 346 females from the 
same study that answered no to a question about their diabetes status and/or measured 

15 <7mM glucose. 

The results of the association test for the two at-risk haplotypes, identified in 
the Icelandic diabetes cohort, are listed in Table 4. Both haplotypes appear in higher 
frequency in the non-obese Danish diabetics than in the control cohort. For haplotype 
Dl, the association to non-obese diabetes is only marginally significant, with a one- 

20 sided P-value = 0.05, and the relative risk of the at-risk haplotype is RR = 3.0, 

somewhat less than is observed for the Icelandic non-obese diabetics. Note, however, 
that the estimated frequency of haplotype Dl is very low, especially in the control 
cohorts, hence the estimates of the relative risk are not very reliable. For haplotype 
D2, on the other hand, we do observe a statistically significant association with a one- 

25 sided P-value = 0.002 and relative risk = 2.74. Note that as the test of association of 
haplotypes Dl and D2 are attempts to replicate the association we have observed for 
Icelandic non-obese diabetics, it is appropriate to report one-sided P-values for those 
tests. 

30 Additional SNP Genotypingfor KChlPl 

Having observed association to the 3 '-end of KChlPl, both in Icelandic and 
Danish non-obese diabetics, we subsequently sequenced 94 Icelandic individuals, 1/3 
non-obese type II diabetes patients with the observed haplotype Dl, 1/3 additional 
non-obese type II diabetes patients and 1/3 controls. The purpose of the sequencing 
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was to identify additional SNPs. We identified 725 SNPs (Table 12). Many of those 
SNPs were completely correlated so we removed several redundant SNPs from 
further genotyping. Some SNPs with very low minor allele frequencies were also 
ignored. Of the 725 identified SNPs plus what was originally identified, 108 were 
5 selected for further genotyping in the Icelandic cohort (Table 1 3). 

A single-marker test of association was performed on non-obese diabetes for 
each of the additional SNPs we typed, although none of the SNPs showed a strong 
association. We did, however, observe that three of the SNPs, KCP_1 97678, 
KCPJ97775 and KCP_202795, increased the specificity of haplotype D2, if added to 

10 that haplotype, while still retaining most of its sensitivity. This is shown in Table 5, 
both for the association in the Icelandic and in the Danish cohorts. This increases the 
value of the at-risk haplotype as a diagnostic tool. Note that the three SNPs are very 
correlated to each other, with pairwise correlation coefficients D' » 0.96 and R 2 » 0.9, 
hence the association of haplotypes D3, D4 and D5 to non-obese diabetes should be 

15 considered as a single observation. 

In addition to the refinement of the at-risk haplotype D2, we observed another 
refinement of the at-risk haplotype, consisting of three SNPs only, that was very 
correlated with the three at-risk haplotoypes, D3, D4 and D5, with pairwise 
correlation coefficients D' « 0.83 and R 2 s 0.59, This haplotype is included in Table 

20 5 as D6. 



25 



30 
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Table 5 





P- 


RR 


PAR 


Aff.frq. 


CtrLf 


Haplotype 




Value 








rq 




Icelandic 














D2 












0DG5S124CKCP 1152CKCP 2649 T 




0.037 


1.69 


6.3% 


0.078 


0.048 


KCP 4976 A KCP 16152 


D3 












0 DG5S124 C KCP 1 152 C KCP 2649 T 




0.022 


2.19 


5.5% 


0.052 


0.024 


KCP 4976 A KCP 16152 T KCP~ 197678 


D4 












0 DG5S124 C KCP^ 1 152 C KCP ^649 T 




0.052 


2.03 


4.6% 


0.046 


0.023 


KCP 4976 A KCP 16152 T KCP~ 197775 


D5 












0 DG5S124 C KCP 1 1 52 C KCP ~2649 T 




0.023 


2.14 


5.5% 


0.052 


0.025 


KCP 4976 A KCP 16152 C KCP 202795 


D6 












A KCP 173982 C KCP 15400 C 




0.054 


1.77 


4.0% 


0.046 


0.027 


KCP 18069 


Danish 














D2 












0DG5S124 C KCP 1 152 C KCP 2649 T 




0.002* 


2.74 


12.0% 


0.098 


0.038 


KCP 4976 A KCP 16152 


D3 


0.0046 










0DG5S124 CKCP 1 152 C KCP 2649 T 




* 


2.60 


9.0% 


0.076 


0.030 


KCP 4976 A KCP 16152 T KCP 197678 


D4 


0.0004 










0 DG5S124 C KCP 1 152 C KCP 2649 T i 




* 


3.69 


11.3% 


0.078 


0.023 


KCP 4976 A KCP 16152 T KCP 197775 


D5 


0.0002 










0 DG5S124 C KCP 1152 CKCP 2649 T 


* r\ i r> 


* 


3.67 


11.7% 


0.084 


0.024 


KCP 4976 A KCP 16152 CKCP 202795 



* One-sided P-value 



Table 5: Microsatellite and SNP haplotype association within KChIPL Shown is association 
of the at-risk haplotype D2, and of further refinements of that haplotype; haplotypes D3, D4 
5 and D5, to non-obese diabetes. This is shown both for the Icelandic and the Danish cohorts 
and, as in Table 4, we report one-sided P-values for the association test in the Danish cohort. 
Finally, we include the result of association to non-obese diabetes, in the Icelandic cohort, of 
a 3 SNP haplotype, D6, that is strongly correlated with the at-risk haplotoypes D3, D4 and 



Allele Numbering System 

SNP alleles are indicated by the letters found in the DNA sequence. In 
general the alleles can be references by A=0, C=l, G=2 and T=3. For microsatellite 
alleles, the CEPH sample (Centre d'Etudes du Polymorphisme Humain, genomics 
15 repository) is used as a reference, the lower allele of each microsatellite in this sample 
is set at 0 and all other alleles in other samples are numbered according in relation to 
this reference. Thus allele 1 is 1 bp longer than the lower allele in the CEPH sample, 
allele 2 is 2 bp longer than the lower allele in the CEPH sample, allele 3 is 3 bp longer 
than the lower allele in the CEPH sample, allele 4 is 4 bp longer than the lower allele 
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in the CEPH sample, allele -1 is 1 bp shorter than the lower allele in the CEPH 
sample, allele -2 is 2 bp shorter than the lower allele in the CEPH sample, and so on. 

Table 6: 

5 The DNA sequence of the microsatellites employed for the COS locus wide 
association (including Build 33 locations). 

Y= C or T; S = C or G; R = A or G; W = A or T; M = A or C; K = G or T. 



TABLE 6 



Name 


Position 


Nucleic Acid Sequence 


SEQ ID 

NO: 


DG5S5 


167638990- 
167639163 


TCCTCAGAACAGGTGCAACACAGTGTGTTTTGCTGGGG 
AAAAGGGATGTCAAGCAATCTATGACGGGGGTGCAGG 
GAGTCTGGGGAGAAACACAAGGAAGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTGAATGTGTGTGTGTGTGAGAG 
AGAGAGCTGGTGTTTGTGTTCCA 


SEQ ID 
NO: 14 


D5S671 


167657904- 
167658237 


GGAATGTGCCAAGACATTCTTTAGGGTTGGTAACCAG 

AGACGCTATTTTGTCCTTGGTGGCTAAGAAATCACTTT 

TCTGACTGAAGGNCCATTTGACTTACTTCTTTTAAATT 

CAGGGGAATGGGTGGGCATCTCCATGATTCAGGTAAG 

GAAAAATCCAAGGNAAATAAACACACACACACACAC 

ACACACACACACACACACACGGAGTAGAAATTTTTAG 

TGCAATTTTTTGTCTCACAGCATTAATTAATTGCAGGG 

ATATAACTACCTTGGCAGAATTTTTTCTCCCCAACCCA 

CCACCCCCCGGAATAAGTTTGGCTCTTTTCAGCT 


SEQ ID 

NO: 15 


DG5S870 


167719773- 
167719939 


TGCCCACTCATAAGATGCTGAGGTTACAACTGTTAATA 

AGATATTAAGATACTGTCTTTTTCTTCCTCTTTCTCTCT 

TACACACACACACACACACACACACACACACTTTTTG 

GGCCAACTGGAAATTCATACATTCTCCCCAGCACTGGA 

GCTCAAAGCGTCTG 


SEQ ID 
NO: 16 


D5S671 


167657904- 
167658237 


GGAATGTGCCAAGACATTCTTTAGGGTTGGTAACCAG 

AGACGCTATTTTGTCCTTGGTGGCTAAGAAATCACTTT 

TCTGACTGAAGGNCCATTTGACTTACTTCTTTTAAATT 

CAGGGGAATGGGTGGGCATCTCCATGATTCAGGTAAG 

GAAAAATCCAAGGNAAATAAACACACACACACACAC 

ACACACACACACACACACACGGAGTAGAAATTTTTAG 

TGCAATnTnGTCTCACAGCATTAATTAATTGCAGGG 

ATATAACTACCTTGGCAGAATTTTTTCTCCCCAACCCA 

CCACCCCCCGGAATAAGTTTGGCTCTTTTCAGCT 


SEQ ID 
NO: 17 


DG5S870 


167719773- 
167719939 


TGCCCACTCATAAGATGCTGAGGTTACAACTGTTAATA 

AGATATTAAGATACTGTCTTTTTCTTCCTCTTTCTCTCT 

TACACACACACACACACACACACACACACACTTTTTG 

GGCCAACTGGAAATTCATACATTCTCCCCAGCACTGGA 

GCTCAAAGCGTCTG 


SEQ ID 
NO: 18 
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DG5S85 


167721558- 
167721918 


TTGTTGTTGTTGGTGGTGGTGGGGTGTGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTTCGAGACAGACTCTC 

ACTCTGTCACCCAGGCTGGAGTGCAGTGGCACGATCT 

GGGTTCACTGCAACCTCTACTTCCTCAGCTCCAAGGAT 

CCTCTCACCTCCACCTCCCAAGTAGCTGGGACTACAGG 

TACGCGCCACCATGTCTGGCTAA'rrrrrn'GTATTGGA 

GAGACAGGGTTCCACCATGTTGCCCGGGCTAGTGTTGC 

ACTCCTGAGCTCAGGTGATCCACCCACCTCAACGTCCC 

CAAGTGCTGGGATTAGAGGCGTGAGCCACCACGTCTG 

GCCTATACACTATAGAGTTT 


SEQID 
NO: 19 


DG5S90 


167766290- 
167766502 


TCTGGACAGGACCAGGAGTTGGCTGCTGTCAGCCTTTG 

CCCCACCTCTCTGTGGCTACTGGGTATGTGAATCTCTC 

AAGGCCTGAAGAGAGGACAGCTGAGGAATTTGGAAAT 

CCTAAAACACATGCATACACACACACACACACACACA 

CACACACACACACACACACTTTTCTTTCCCTTAAAAAA 

AAAAAGATTCATTCACCGTGTGCA 


SEQID 

NO: 20 . 


DG5S874 


167846718- 
167847065 


CTGTCTACACTACCCACCCATTAGTCACTTATTAGCCC 

TCTGAATTACTGGATTGAAAAAACATAGTATATATATA 

GGGCTTGGTACTATTCACGGTTTCAGGCATCCACTGAG 

GGGTGTTGCAATGTATCTCCCACGGATAAGGAAGGAC 

TGGTATATTAACACTTTTATTTGATTTACAAAATAAAG 

GATAGTTTATATAGTTCTGGGTAAAATTAATTAATTAA 

TTTAAAAGGAAAAAAGATAAAGGCAAACTTTAAGCTT 

GTTAAAAATTAAGTAAAATAATTTGGATTATTTAATTG 

GACAAAGAGGACTGGCTTTGCCAATGAAACAATATGG 

CCGACATG 


SEQ ID 
NO: 21 


DG5S88 


167864864- 
167865059 


GGACCTTCTTTCTGCCCTAAAACCGCAATATCATTATA 

ATAACAAATATATATATATATATATATATATTTTTTTTT 

TTTAAAACAATCTTGCTATGTTGCCTAGGCTGGTGTGG 

AACTCCTGGCCTCAAGTGATCCTCCCACCTCGGCCTCC 

CAGAGTGCTGGGATTATAGACATGAACTACCATACCC 

AGCCA 


SEQ ID 

NO: 22 


DG5S7 


167910343- 
167910651 


CACAGCCATCAAGTTTCCAACTTACTGCCTCACATATT 

AAGATGATTrrrriAAACAAACTTAACAGGCGATGGAT 

ACTCCATTCTCCATGATGTGCTTAATTCACATGCATGC 

TTGTATCAAAACATCTCACATACTCCATAAAGCCTGTA 

ATCCCAACACTTTGGGATGCCAAGGTGGGTGGATCAC 

TTGAGCCCAGGAGTTTGAGAACAGCCTGGACAACATG 

GCGAAACCCCATTTACACACACACACACACACACACA 

CACACACACCACACAAACAAAATGAAACAAACACCTA 

ACCAACAA 


SEQ ID 

NO: 23 


DG5S6 


167952553- 
167952858 


TCCTAACGGCTGCTACCACTAAAGATCTTAGCATGGTG 

TGTGTGTGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

TGTGTGGTGGGGCTATTCAGTAAGGCTAGAAGTGAAA 

AAGCTAGTAGAAAGCCCATGGTGATGGAGAATGGAGG 

AAGACTGATTAGGGAGCTCCTCAGCAGTATAAGGAAG 

GACTAAGAGCACATAAGGACAGGATCATAGAATTCCG 

CATCTCAGGATTTTTGAGGCTGCCACTGCCTTAGCTGT 

GAGGCCAGTGCATATAAGAATAGTTTGCACAGTTCTG 

CTGTGG 


SEQ ID 
NO: 24 


DG5S87 


167992779- 
167993149 


CCTCTGGGATTAGCCTCTCAGGGTACAGATATAACGAT 

GATTGAGTTGGCTTATGTATGTGTGTGTTGTGTGTGTG 

TCTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGA 

GAGAGAGTGACAGAGAGAGAATGAGAGAGAACTGGA 

AGTTGTCAACAAGAAGAGTCAAACTCTGTAAAATATT 

TGAAGAGATTTATTCTGAGCCAAATAGGAGTGCCACA 


SEQ ID 

NO: 25 
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GCCCCGGGAGATCCTAAGAACATGTGCCCAGAGTAGT 
CAAGCTATAGTTTGGTTTTATACATTTTAGGGAGACAT 
AAGACATCAGTCAATACATGTAAGATGCACATTGATA 
CACTGGTTTAGTAGGGAAAGGTGGGACAACTCGAA 




DG5S91 


168014827- 
168015078 


GGTGCCAATTAAATCCAACAAGGTAGCTGAGTGTGGT 

GGTGCACGCCTGTAGTCCTAGCTATGCAGGAGGCTGA 

GGTGGGAGGATCACTTGAGCCTGGGAGGTCGAGGCTG 

CTGTGAGCTGTGATTGCACCGCTGCATTCCAGCCTGGG 

AGACAGAGCAAGATCCTGACACACACACACACACACA 

CACACACACACACACACACACACACATTCCAACAAGG 

TAATGTGTAGGAGGAAGTACCCGAGCTT 


SEQ ID 
NO: 26 


DG5S92 


168065529- 
168065864 


CAACTCCTGCAGCCCTTTACGCCAAGCACAGAAATCC 

AGGAGGCAGAGCCTAGCGCTTGATGACATGGTAATTG 

GGCCTGGAAGTGGGGATTTCTGTCACTTACCTCTCCTT 

GAAAAATAATCACTATTGCCAACGCCTGGTTAATTAGC 

CTGATTCAATTCTCITCAGCCTCATTTTGCTCAAATCTA 

CCAGATTTGTGGTGCTCCTTGGTCCTCCACCACACTTT 

CTACCCCTCATCCCACTTTGTGTGTGTGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGGACA 

TGGCCAGGAATCCTGACTGGCTTCCTTTAAA 


SEQ ID 

NO: 27 


DG5S491 


168081175- 
168081342 


AAGACCACCCTCCTGTTGTGCTCTCCTGAAATGTATTC 

ATATCCACCCATACACACACACACACACACACACACA 

CACACACACACACACACACACACACACATTCTCTCTCT 

CTCTCTCTTTCTCTCTCTCTCTTAAATGTCAGTTTTCTCT 

TCCTGCTTTCCAGA 


SEQ ID 

NO: 28 


DG5S9 


168139425- 
168139680 


CTTGACATTCAGGGCCTTCTGAGTACATCATCTTGTCA 

AGAAACACTGAACTATTCAGTACACAACAGGTCAGAG 

GTGCCCATTTGATAGCCTGAGGATGGAATCCTTATTGC 

AGCATTTTGCGTCATGCCACATATATGTGTTTTTTCAAT 

CCTCCTCTGTTTTAAAAATTGGAAAATTTCATACAACA 

CACACACACACACACACACACACACACACACACACAC 

ACCCCCCATACCACACCACACCACATCA 


SEQ ID 

NO: 29 


DG5S876 


168266982- 
168267134 


AGCCTCTGACTCTCCTCTGTGGGGCTAATCCAGAAAAT 
CTTACTTTAGAAATAACAATAATAATAATAATAATAAT 
AATAATACCTCATTCATCTTTACTTATCATGTGCTAGT 
ATGTTTCTAAGCCTTTTGGCATAGCCTTCAATGTCCCT 


SEQ ID 

NO: 30 


DG5S97 


168286866- 
168287096 


CTCTTCCCATGTCCTGCTTCTCTCCTTCCTGCGGGTTGG 

GACCCAGTACCCTCCATCTCTCTCACTCCTCCCCTCTCA 

AACCCTTCTTTTAGGAAAGGAGTCCAAATCGACCACTT 

ACACCTCAGTTCAATGCAAGCCAGTATAATTAATAAG 

GAACATTTAAGGGTGTGTAAGGGTGTGTGTGTGTGTAT 

GTGTGTGTGTGTGTGTGTGTAGCTCACTCTGCCTCTGC 

C 


SEQ ID 
NO: 31 


D5S2052 


168324273- 
168324633 


GATCACCAGGGAATCTAGATGGAATCCATAGTNCTNC 

CCTGCAAGAATGCTGCAATTCTGTACCGTGGAGGNGC 

CAACAGAATCACCAGGCTCTGTGACTCAGTCACAACA 

CCCTGACCTGCCCCTGTCCATTCTCCATATCATACCCA 

GAGTGGTCTTTTCAAAGCACAGCTTTGACCAATTCTCT 

GTCTTTCACACATACACACACACACACACACACACAC 

ACATGCGTGCATGCATGCCTGAAATAGTATAGTATTGC 

TCTTAAGATAAACATTAANGTTCCTACCATGGTACAGA 

AAATATATGTNGTTAGGCCCCGTGGCTCTTTCTTTTCC 

AGACTCCTCTTACCCTTTTGTG 


SEQ ID 

NO: 32 


DG5S879 


168369069- 
168369202 


AAATCTTCCATTGCAGACCAATTAAAATTTAAAGATTT 
TCTCTCTCTTTCCCCCTCTTCTCTCTCTCTCTCTCTCTCT 
CTCTCTCACACACACACACACACACACACACAACTCTC 


SEQ ID 

NO: 33 
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CAAGCACAAAGAGCTGA 




DG5S880 


CHR5: 

168376530- 

168376775 


TGAGTGGATGAGGGAGAAGGATAAAAGTCATAAAATG 

CCTCGCAAAAACTTTGAGGTCTGCCTGCCTGGTATTAC 

AGAGAAGCTTGCACAATACAGAATGTTTTGTGGGAAG 

GAAGGCAGGCAGGCAGGCAGGCAGGCAGGCAGGTTG 

GTTATGTTTTCACTCTTGATATCTCAAAGCTTTATGACA 

CACTCATGGAGTGAACATAATCTTTGTGGCATGATACA 

AAGGGACTGAATCACTCAAG 


SEQ ID 
NO: 34 


D5S400 


168378412- 
168378696 


AGCTATGATCATGCCACTGCACTCCAGCCTGGCTGATA 

GAATGAGATTCTGACTCAGAAAAATATAAACACACAC 

ACACACACACACACACACACACACACACACACACACA 

CACAACTTTCTGTCTGCCCCCTTGCTCTTCCTGTCCCAT 

CTCTGCCTITCTrCTTTCCTCTCITTGTCAAATCT 

GTCTGCCTCACAAAGGCCAGTGAGCCCCAGCCGCAGA 

CCAGGGAAGCCAGCAAATTAGGAATTTTCTTCACAAA 

GTTTTGAGTAGCT 


SEQ ID 

NO: 35 


DG5S881 


168395631 - 
168395815 


TCTCCCATCTCCTCCCTAGCTATCCCCTGCCTGGTAATC 

ACTGTTTTGTTCTCTATTTCTGTACAGTTGGGTTTTTGT 

TTGTTTGTTTGTTTTTAGATTCCACATATACTTGAAACA 

ATGCAATCTTTTTCTTTCTTTGTCTGGTTTATTTCACTT 

AACACAGTGCACTCCAGGCTCGTCTATG 


SEQ ID 

NO: 36 


D5S2043 


168499686- 
168499956 


CCAGAAAGCTTCACCAAGGGGCGAGCATGATTATGCA 

TGAGCTTCTTAAATCTGGACTTCCCGACAGCTTCTCAT 

GACAGGTCTTCTGTGGAAGACTCCTTAGATTCAGACCA 

TCAGGCCTTTNAAAAGCACAGGAACCTACTTTACCTCG 

CCCAACTCTACGGATGGGATAGGNACTTACAAGGACA 

TTTCCTCATTGGATTCCAATGTTCATTCTCCCCTTCTCT 

CTCTCAATTAATCTCCCCCTCTTCTCTTTCTATCTACAC 

ACACACACACACACACACACACAGAGAGAGAGAGAG 

AGAGAGAGAGAGAGAGANAGAAACAGCTTCTTCACA 

GCGGGAAGCAGGGGAAGGGTATCTATTTCCGGCAAGA 

TC 


SEQ ID 

NO: 37 


DG5S1058 


168554788- 
168555167 


AAAGTCAGCTAGGGCGACTGAGCCAGAGAGATGGGGC 

ACACAGCAAGAGGAGACCTGACAAAGTGCAGGTGTGT 

CCTAGAGAAGGCAAGCGAGACCACTGCATGATTGGAA 

TACCAGCCAACCCTTGCCTGTTCTTGTTCCAGCAAAGT 

GCCCTTTTAAAATAAATTTATGTATATAGTCTCTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTATAGACATAT 

AGAAATATATATTCCTAATTCAGAACTCATTCGTAAGT 

GCACACACTGACATGTGTTTCATGTTTCCCAATTTATC 

CCAGAGCCTATATGCAGTGTTTGGCTGCACAAGTAGG 

CATTAAATGCAACCACTGGGAATGAGAATGGTGGCCA 

CAAC 


SEQ ID 

NO: 38 


D5S2075 


168569742- 
168570114 


AGCTTTGAAATCCCATCCAAACTNATTGGCGTTTCAAA 

CTGAATCCCAGATGTTCACTTACTGAGAAATAAATGA 

ATGGCCCAATTCGTGGACTGANGCAGGGNCCCTCACA 

AATAGATTCCAGGTGTGTTGGCCTCTGGACCACTATCT 

TTCTCTGTTTTACATACACATACATACACACACACACA 

CACACACACACACACACACACACACACACACGGCACC 

AAGTCCATCCTGAAAAGAATTCAACGTCATCTCCAAGT 

TAGAGCCAGTNTAGGATGAACAGAGGTAGTTACCTAA 

CACAAATAACATATTTTCAATTGTGGATGAAGGCAAA 

GGGCTCCACATTCACACTCTTGTGCCTTCAATA 


SEQ ID 

NO: 39 


DG5S883 


168592111 - 
168592265 


GGCATGGGATGTTTGACCTAAGAGATGGACTAAAGCC 
AATGAGTAAAAATGTAAAGCGTACTTAGTCAAATAAA 
TTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCCTTATTCA 


SEQ ID 
NO: 40 
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TCACTCTTGGGCCGTGATGATGATGAGGGAGAGGAGC 
AGT 




DG5S38 


168715977- 
168716367 


TATTTGCCTGCCTGGGTTAGATGATTCTCCAGGCTTCT 

ACACAATTTTATGTTTATATGAAAATAGCCACAAAGG 

GAAAAGAGGACAATAAAACAAGAGATATGAATAATA 

ATGTATTGTATACTTGAAATTTGCTAAAAGAGTAGATC 

TCAAGTGTTCTACATACACACACACACACACACACAC 

ACACAAAGGTAATGAATGAGATGATAGGTGTTAATTA 

ACTTGATTGTGGTAATCACTTCACAATGTATACATATA 

AAAACATCATGTTTTACAACCTACATTTATACAATTCC 

TCAATTATATATCAATAAACCTGGAAAAATAAAGATG 

TATAAAAAAGATTTACAAATAAGATTTTTAAAAAAGG 

ATTGTGAGGAAACAAAG 


SEQID 
NO: 41 


DG5S37 


168770226- 
168770418 


ACCAGCTAACCTGCCATGAGACTGTTGTGTAGCCATCT 

TCACCTCCTCATCTTCAGGGAAGGGGATGAAAATATCT 

GTGCACTGCAAATGTTAACTATATATACACACACACAC 

ACACACACACACACGTACAGTAGGCCCTCCATAACCT 

GAGGTTCCACATCTGCATATTTTACCAACTCTGGTCCC 

TGC 


SEQID 
NO: 42 


DG5S886 


168803195- 
168803445 


TTGTTCCTGAATGGGAGGAGGACTGGTGAGTGA GGGG 

GAAAGAATGGAGACAGGACTGAGAAGAACCAGAAAT 

TAAAATAATAGTAGTAATAGCCTAACATGTACACGTA 

TATGAGATCTATCTATCTATCTATCTATCTATCTATCTA 

TCTATCTATCTATCATCTATATATCTATCATCCATCATG 

TATCTATCTATTTGCATATATAAGCTATAATATCTGGC 

TCTGTTCTAATTGTTT 


SEQID 
NO: 43 


DG5S101 


168833451 - 
168833700 


CCAGGCTTGGATGAGAGAATAATCTTAAGGAAGTCAG 

CATATGTTCTAGAAACATTCAGAAGACAAAAGAGTCT 

GTTATGAAAGAACAAAGTATTTGTAATAATAAATTGA 

ATGTTACATGGACACACCCAGACATACACACACAGAC 

ACACACACACACAGTTTTTCTTCTCTCTCTCTCTCTCCC 

CACTCCCCTCTCTCATACTTTGCAAACAAGCTCCTCAG 

CAGCTGGTAAGCTGTTCCCTGTCC 


SEQID 
NO: 44 


DG5S102 


168895047- 
168895352 


TCCTGACTGCTCAAAGCTCAAGGTGTTGCCTTTTTCAA 

ATGGGATGCAATAGCCTACrCATTTTCCAAGATTAAAG 

CTAGAGAGAAGAATGAATGAATGAATAAATAAATAAA 

TAAATAAATAAATAAATAAATAAATGAGCAAAGTTAA 

TATTAGCTGGAAAAAATAGGGTACAGGTGGAAGGAAT 

GAACCCATATTGAGAGTCCACTATGTGTCAAATTCCTT 

GCATGGAATCTCTAAGGTCTGTCTAGCTTAAAAGCAAT 

GCCAGCCTTGCTATCTGTACTTGATGAGGAGATGGATC 

GGAA 


SEQID 
NO: 45 


DG5S39 


168920224- 
168920577 


CCAAACTGCAAACCCAAACTTCTACAATGAATTCATGT 

GCAACTTATTCTAAAAGATCTATACACACACACACAC 

ACACACACACACACACACACACACACACTTCCTGTCCT 

ATTGCTCTTCACTACTTCCTTCATCTCTGTGCTACAATC 

TGGGTTCATTTTTCTTCCCCTTGAGTAATTTATTATGTT 

1111 ACAGTGAGTCTGTTGCTC A AA AATTCTTTTAGTAT 

TTATTTGTATAAAAAGTCTTAATTTTGTCTTCATATAAA 

ATTTTGTTTGACACTCTATTATAAATTGACTGTTATTCT 

CTTTCCATGTTTTCCGGACATAGTTCCATTGTCTTCTGA 

CTTCCA 


SEQ ID 
NO: 46 


D5S1456 


168968063- 
168968256 


TTCNACCTTATGGGTATATCGAATTGTAACCCCGTTGT 
AGGTCAAGGAGCATCTNCATATATACATACATAGATG 
ATAGATAGATAGATAGATAGATAGATAGATAGATAGA 
TAGATAGATTTAATTCTAAANTTTCCAAATACTCTTTC 


SEQ ID 
NO: 47 
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ATTTAAATGATTATAGTTTTACAACAATTTCATATATT 
NTATAGGTAGGAGAATTAGGGTTTTCCAGAGAAATAG 
ANNCAATAGGCTGTGTGTGTATATAANGATTTANTTTN 
AAGA 




DG5S106 


169021310- 
169021609 


GTTGGGCATGATGGTGTGTACCTATAGTCCTAGCTACC 

TGGGAGGCTGAGGCAGGAGGATCCCTTGAGCCCAGGA 

GTTTAAGACTAACAAGACTCCATCTCTGAAAAATAAG 

GCAAAAAAAGTATGAAGAATAAAATAACAATCACTTA 

CATTCCAACCACCTATAATTAATCATTGCCAACACCTG 

AGGATATTTGCTTCCAATCTACAAGACTGCATTATTAT 

TATTATTATTATTATTATTATTATTATTATTATTATTATT 

ATTATTGAGATGGGGGTTTCTCTTTGTTGCCCAG 


SEQID 
NO: 48 


DG5S40 


169067041 - 
169067434 


GCAGACAGGGTTCAAATTCCAGCCTCCCCTCATATTAG 

CTGTGTGATCTTAGGCAAGTTTATTCATGTGTAAAAAG 

AAATAATAACCTCTTCTCGTGGGGTTGCAGGTTAAACA 

AAAGAGTAGGTATTAAAACAACTAAAAGAGTATGATT 

GGATTGTTTATAACACAAAGGATAAATGCTTGAGGAC 

ATGGATCCCCATCTTCCATGATGTGATTTTTATGCATT 

GTATGCCTCTATCAAAACATCTCATTTACTCCATATAT 

ATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

GTGTGTGTGTATGTGTATGCACACACTATGTGCCCACA 

AAAATTAAAAATTTTAAAATTAAAAAATTTTAAAAAT 

AAACATGCTGCTGGGC 


SEQ ID 
NO: 49 


D5S504 


169142805- 
169143006 


AGCTGCCAGCACACAAGGCCCAGAGGTACTTTATTGG 

ATGCCAAATTCTTTTAACACACCATGAGAAAAGAAGT 

TGACAACTTTCCCATGCATTTTGGAAGGTGTGTTAGAA 

CGATTCAACACACACACACACACACACACACACACAC 

ACACACACACACACATTTATTGGGTTGGGGGAGCCTT 

AAAACTTACAAATCT 


SEQ ID 

NO: 50 


D5S1961 


169173385- 
169173631 


AATTTCTTACCCTTTATCCCCGTCTTCTACTCCCATTCC 
ACATGCCCCCCACCCCCCATGAATGATTTGTCTAATGC 
GTTGAATATGCACGTGTGGTTTTGGGTGCATGTATGTT 
TACACACACACACACAGACTCTCTCTCTCTCTCTCTCA 
CACACACACACACACACACACACACACACACACTCAT 
GCCTCTCCTTTGAAGAGGATCAGATATGGACAGCAAA 
GGGCATTAGCATCTTAGCT 


SEQ ID 
NO: 51 


DG5S108 


169203019- 
169203370 


GCTCCAAAGTTCCTGATTCCAAAGCCTAAGTTAAGCCA 

TCTTAACITATATTTCAGGTGACTAGTGGAATTTTTTAT 

GCCCAGTGTAGGTGGTAATGACGATGGAGATATGGCG 

ATGATGATGATGATGATGATGATGATGATGATGATGA 

TGGCAGTAGTGGTGGTGGTGTTGGTGATGGTGTTGAA 

GATCACCATGTCTGACCACTGTTGTCTGTGTCCTTTGT 

ACAATTGTTCAGATGCAGTGTCCTGGTTGTCACATAGA 

TGTCTCTGACTGTTTTACAGGCCTTCACCTACCACCAT 

ATCCAGGAGATCATGGTCCAGCTGCTGCGGACAGTGA 

ACCGGACAGTCA 


SEQ ID 

NO: 52 


DG5S41 


169277936- 
169278300 


TTAACCCACTGCTCCCATTTTGCTGGTGGAAAACTGAA 

AACCAAAGAGATTAAGTCATTTACCCAAGGTCATGTA 

ACTAATATATTGAATCTCAGATGTTTTAATGATTTTGA 

CTCATTTCCAATTTGCCTGGCTATATAGAGAAAATATT 

TGAGGAATTGACAGGGAACACACACACACACACACAC 

ACACACACACACACACACACACACAGAGGGAGGGAG 

GGGAGAGAGAGAGACAGAGACAGACAGAGACTGAAC 

AGATTATTTCTCCACTGATGTTCATTATTTAGATCTATT 

TTCAACATTTAAAGGCAATTGTCAGCATAGTCAATTCA 

GCCATTTTAAACCATCAAGGGCCAATG 


SEQ ID 

NO: 53 
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DG5S42 


169285983 - 
169286144 


GCAACCTATTTGTTAGCAGCACATGCGTGCGTGTGCAT 
GCACGTGGGCACACACACACACACACACACACACACA 
CTCTTTGCAGGGGAATTTTGAGCCAGAAATTTATCTGT 
AGGCCCATATTCATTCCTTTTTGCACATTTCTATTGTGA 
CCTTGGGCA 


SEQID 
NO: 54 


DG5S10 


169356049- 
169356318 


CAATTCCACACAGCTGGAGAGTAACAAAGCCAGAAAT 

CACATCCATTTGTGTGTGTGTGTGTGTGTGTGTGTGTC 

CAAATCCTGTGATTCCAGTGCCAGGATACACTGTCTTC 

CGTGTTC AAC AGTC ATG AAAGTA lTl'l AATGA AC ACCT 

GGCCCTGCAGTGCCTGATGTAGCAAATGCTGCAGATA 

CTCCACCCACCGACTCTTGGACCACCCAAAATCCACTG 

GCAGCTTCAGTGAGGCTTTCCTACTrCITTCTTTCCCTG 

GGCT 


SEQ ID 

NO: 55 


DG5S110 


169391090- 
169391341 


CACTGGACTTGGAGTCAGGACATCATTTTAACAGCTTT 

ATCGAGATGTAATTTACATGCCATACAATTTACCCAAA 

GTGTACAATCATTGACTCTTAGTATGTTCACAGAGCTA 

TGTAACCATCACCACAATCAATTTTACAACATTTTCAT 

TACTCCTAAAAAGCAAACCTGCACCCCTTAGCCACTGC 

TCTGCCAACACACACACACACACACACACACACACGC 

GTGCGCACACACACCCAAACACTC 


SEQ ID 
NO: 56 


DG5S11 


169409260- 
169409401 


CGCGTGTACCTCCACATAGATGTTTGCCAATGCCTATG 
CCCAAGACACACACACATACACACACACACACACACA 
CAGGATACATTCAAGCACACACTAATGTATGTGCACTT 
GCCTGCACAGAGTCCACATCACACAGGC 


SEQ ID 

NO: 57 


D5S1973 


169424532- 
169424861 


AGCTTCTAGTCTGCTATGTTGCTAATTGTCTTCTTGGTC 

ATCTTTTAAAACCATTTCTGTGAAATTATAGCCTCCTT 

ACTCCCTTACCCTGAGTCTGGATGTTTCTGAAGATGAC 

TGATCTCTACAGTGAGAAGGCCCTGGGAATTGACTGA 

CTCACTCTCTCTGTCTCTCTCTCTCTCTCTCTCACACAC 

ACACACACACACACACACACACTCATATACATACACA 

CATAGATACACATATACATGCATCCACACATGCACAC 

CCTGGGCACACCCACACACCCTACAACTGCACATGCA 

TGCACACACATAATGTTAACTGAAGG 


SEQID 

NO: 58 


D5S397 


169542970- 
169543287 


AGCTTTTGGCTATGGAACCTTAGGCAAGATGTTCATAA 

ACCCTTTAATCTCTAGTGCCCTTGTTCATAAAAAGAAG 

TGAATCGGATCCCTGCAGGACTGTTTTTGTATTCAGTG 

CACAGGTGTGTGTGAAGACACCCAGCATGTTGCCAGG 

CACACAGAGATGTCTACCTTGATACTTTTCTCTCCTCCT 

CCCCGCAAATACACACACACACACACACACACACACA 

CACACTCACACACTCTTATTTTGATCTTGGCCTGAGGC 

TGACAAGCCCCAGATTAGTGATCAGTGACAATTTCGG 

CTTTATCAGCT 


SEQID 
NO: 59 


DG5S115 


169586308- 
169586550 


CGAGGATGCACACCTCATTAATTGAGGAGCTAGGATT 

TAGATCCAGAGCCTCATGATTCTAAAGCCTGTTTTTTG 

TTTGTTTGTTTGTTTGTTTTGTTTTGGCCACACTAGGTT 

TCTAGAAACTTCCAGTTCCTTCTTAAAAGTCCTTTTTGG 

GCATTCCGGCCTAAATCCCAAAACTGTGGTCTGGGTAC 

AAGAGAGAATTAGGCCAGTGAGAAAAATTTAAACCAC 

CCTGCCCTCTAAAT 


SEQ ID 

NO: 60 


DG5S888 


169653226- 
169653848 


CTTGATGTCTTCTTTCACCATCCCCTGGCACCCGCCTAT 

ACATTTATTACTTGAAACAGACTGACCTTTATTTGGTT 

AGGCCACTGTGGTCAGGTTTCTGCAACATGGGGTCAC 

ATGCCTTCCCAACTGACACAAGTCTCAAGCTCCTTTTC 

TCTTCTTTTT AT A ACTTCT AG A A GC AT A GCTTCT ACC A G 

ATAAGGATCTAACCTTTTCAGTGGAAAACAAAAATGG 

CAAAGAAGTAAAGAAAGAAGAGAGAGAAAGAAGAAA 


SEQID 
NO: 61 
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GAAAGAAAGAAAAGAAAGAAAGAAAGAAAGAAAGA 

AAGAAAGAAAGAAAGAAAGAAAGAGAGAAAGAAAG 

AGAGAAAGAAAGAGAGAAAGAAAGAGAGATGGAGAG 

AGGGAAGGAAGGAAGAAAAGAAAGAGAGGGAGAAG 

AAAGAAGACAGGGAAGGAAAGGGAAGGGAAAAGAG 

GGAAGGGGAGAGGGGAGGACAAGGGAAGGGGAGAG 

GGGAGGACAAGGGAAGGGAGGAAGGAAGAAAGGAA 

GGAAAGAAGGCAGGAAGGAAGGAAGGAAGGAAGGA 

AGAAAGGAAGGAAGGAAGGAAGGAAAAATAACTAGG 

GCCTTTCACTTTTGCCTTCAATAGCAGAGTGGCCCTGG 

ATAT 




DG5S44 


169661202- 
169661574 


TATTGGCAGAGGGTGAGTCCAGTGTATAAAAGCAACT 

ATATTTGTGCAATAAGGCAACCTCTAAACACAAGTTAC 

TACTTCATCTAATGCCACACACACACACACACACACAC 

ACACACACACACGAGTCATCTGTTCCAAGGCTGTTGCC 

TTTACTAAGTGATGCTATGTTGGTCCTTGAGGTGGTGC 

CTTCCTGAGGGTTTTCAAGCATAGCTTTGGCCATGCAC 

AGTTTTCTTCTTATACACACTCTGAGGAGCCCCGCCGT 

CACGGTAATGCACCTGCCTCACAAGCTGGTGGGCAGC 

TTAAATGAAATACACATTTTGCTCCAGGCCCAGCACTA 

GCTCATCAATGTGAGCTGGTGTTAGCCTCACC 


SEQID 
NO: 62 


DG5S45 


169693772- 
169693912 


CAGTAGCCAGGAAGCTGAGGAACACACACACACACAC 
ACACACACACACACACACACACAAACACACCCCTTCC 
TGGCTCCAGTTCCGCACCACCCCACACCCCCAACACCG 
GAAGTAGATTTCTCAATAGGCAGGGCTG 


SEQID 
NO: 63 


DG5S46 


169702377- 
169702678 


TTTGCCAGAATGTCCTCACACCAAATAGTGGACCCCTT 

CITTTGCTGATTTATCTGCTATTGTATAGGTGTATGTGT 

GTGTGGGTGTGTGTGTGTGTGTGTGTGTTAAGGCAGGT 

GGTAGTATGTGTAGGGTAGGGTTTCCCCAGTCACCTGG 

AGCCCTGAGTGCCTGCTTCCCTAAACTAGGCCAGTTTA 

GCTGACTGGCTTCCTTTGTGTATTGGTCCATTCTGCATC 

AAAAGCATCTGAATTTTCATTCAATCTCTCTTCTGAAT 

TTTCACTTTTAAAAACCTGACCAGTCCCTTGTG 


SEQ ID 
NO: 64 


DG5S47 


169788696- 
169788899 


CTCCTCCATGGTAGGGACTGGTTCTCTTAGGCCCCTGT 

ATCCTCAGGCCCAGCATGCTTGGGAAAATGTTTGCTAA 

TGCTTTGTGACTCAAAAGGAATCACACACACACACAC 

ACACACACACAAACACACACACACAGTTTTTAATATT 

ATCAGTCATATCAGCCCCCTGAGGCAGCTGCTCTGTTC 

CAGACAAACCCTGTT 


SEQ ID 

NO: 65 


DG5S119 


169843903- 
169844041 


GGGTACAGGAGAGTTGTGGTGGGCATTAGTACTACTC 
CTGCTGCTGCTGCTGCTGCTGCTGCTGTGTCCACTGTT 
AGTGACAGAAGTGGGAAAATATTTAAGTTGAGTTCAC 
ATTAGTGTTCCCAGTTTAGCGTGAGC 


SEQ ID 
NO: 66 


DG5S953 


169866165- 
169866415 


CCATGAGTTCAGGCAGTGGGTTAAATAAGATTTCCCTT 
GAAGTCGAATGAAATCACAATGCACCACACACAGGGA 
CACACACACACACACACGCACGCACGCACATCACACA 
CACACACACACACACACACACACACACACACACATAC 
ACACACACAGTCTCCCTGGGGCCAATCTACTGCCCCCT 
GAACCTCACCCATCAGCCAGGTGCCTGGCCCCGGGTCT 
GTCTCTTAGGGTTACATGCTCCCGG 


SEQ ID 

NO: 67 


DG5S955 


169951970- 
169952619 


ACTTATGGAACACCTACTCAGTGCCAGGTATTGTTGTA 

GATGCCAGGAGTACAGCAGGGAATAAAACAACATCCC 

TGTCCTCGACACAAACACACAAGTAAATAGAGAAGGT 

CAGAGATAAATGCTGTGCAGGAAAACAAAGCAAAGTG 

AGGGATGGAGAGTGCGGAAGGTTGGGGCACTTTTGTT 

TCAGATGAGTGTCAGGGAAGCCCCCTTGGAGGAGGCA 


SEQ ID 

NO: 68 
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CTGTAAGGGCACAGAATCGAATGAAAGGAGTATGTGA 

AGGTGCTTAAATTGTTTCTGTTTGGTTTGGTGTGGTGT 

GATGTGGTGTGGTGTGGTGTGGTGTGGTGTGGTGTGGT 

GTGGTGTGGTGTGGTGTGGTGTGGTGTGATGTGATGTG 

GTGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGTG 

GTGTGGTATGGGTTGAGGCTGGCCTTAGGAGCCTGTTG 

GCCTTCCAGGCCAGTCCTGAAGCCCAGCCCAGAGCAC 

CAGACTCTGCAGTCAGTCAGTGGAGGGCCCACATCTC 

AGCCAATGCATGGCTTTGGGTGGTGACTTCATCTCCCC 

TAGTGTTCCTTTCCCCCTCTGCAAAATGGGAATGGGGA 

TGGCTCAGAACTCCCAGCGGGAGTTAGGAGGAATAAT 

GTATAGGAAGTATGAGCAGAGTGCCTGG 




DG5S13 


169961410- 
169961530 


TGATGTGCTCGTTCCCATAGCCCCGCTGTGTGTGTGTG 
CGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTTTGGTGGGGTGGGAGGGGAGGCAGAAGAGGAAG 
AGAGGGCA 


SEQ ID 
NO: 69 


DG5S123 


170015858- 
170015997 


TGGTGATCAGCTCAGTGTCCTTGGAAAAGAGCAGAAA 
GTGGTATCACGAACATATCITCTCCTTTGCrTCCTTCTC 
CTCACTCTTCATCATCATCATCATCATCATCATCAAAT 
ATGGATCTGTGAGGCTACCTCTGGG 


SEQ ID 

NO: 70 


DG5S124 


170041996- 
170042336 


GGAGGAGAGACCAGCATTCACATTCAGTTATTGTTGTT 

TTAAATCCATTACGCACATACATAGGAGAAAATTTCA 

GCAACAGTCACCCTCTGAACCCAGTTCCTCAGTTCTCT 

CCAGAGGCAACTAAAATGCTCAATTATTAGTGTATCCT 

TTTGGAAATATTTTATGTATATGACAGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGTTCCTTTCCAATATTA 

AAATAATATTAACATTGGTAATAGTGGTACTAAACAA 

CTrAGGGTGTTTTTTTTTTCATTTAATAGTATATTTTTA 

GTATCTTTCCAGGAAAAGATACATGGATGTGCCACA 


SEQ ID 
NO: 71 


D5S625 


170105556- 
170105787 


TCAATAAATGATTCTGGGGATGTGTCTGTCTGTCCATC 

TGTCTCTCTCNAGANACANATACACACACACACACAC 

ACACACACACACACACACATCCTGTTAGTTCTGTTTAC 

CTGGAGAACCTTGACTAACATACCCATTAAAACCAAA 

ATATGTCCTTCAGGGTGTTAATGTTTGGTTGAAGAAAC 

ACAGAAGTTTAACAATTGTATCAGGCTGGGCACGGCC 

TATAATCCCAGCATTTTGGGAGGCCACAATGAGNGGA 

TCACTTGAGCCCAGGAGTTCTAGACCAGCCTATGCAAC 

ATAGTGAGACAAAAAAATGAANAAAATTAGGGGTGTG 

GTGGAGCGCACCTGTAGTCCTAGCT 


SEQ ID 

NO: 72 


DG5S959 


170167429- 
170167616 


GAGTTCTATGGAACAGCATTTATTGAATAATAACATTT 

CAGGAAAAAATATAAGCTTTACTGTATATTAAAATAC 

ATATATACGTTTATATATTATATATTATATTATATTATA 

TATTATATATATTATATATATTATAATATTTATATATTA 

TATAGATATAAATCAACTACAAGATCCAGTTCAA 


SEQ ID 

NO: 73 


DG5S960 


170203240- 
170203459 


TTGCCTAAGATCTAGGTGTTCAAAAGAATTTCTCCAAT 

CCACTTCAGCCTGTTATTATGTATGTATTCTGTTTTAAA 

TGAGAAACAAGAGTCATTTTCTCCAGAATAATAGAAC 

CATAGTGACACTTGAAGTAAGTCCAGTGGTCCTGATAT 

GATAATAATAATAATAATAATTATTATTATTATTATTA 

TTATTTTGAGACGAGGTCTGTATCTGTTG 


SEQ ID 
NO: 74 


DG5S16 


170280782- 
170281084 


ATGGAGAGACACGGAGTTGCTTGAGGGTACAGTGCCT 

GTAGATACTCAATAAATATTTGTTTAATTAAGAAAATT 

TCTGTTATTTGTGTGCTCATACATACCATTTCAGTCTGG 

TGAGTATTGTTCTTTCCTAGAGTTTACTTTTAATCTTAA 

GTATTTTCCAGGTCCTTTGTTGACTTCTGTTTAAACCAC 

AGTACACACACACACACACACACACACAACTTTTGTG 


SEQ ID 

NO: 75 
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TACTATAATAGCTTCCCCAAAATTATAATTTAGTCATT 
GTGATGCAGATCirCTTCCAAGGCCTCTACITTGG 




DG5S962 


170338421 - 
170338789 


AAACAAACAAACAGAAACACCAAAATGGATTCCAGCA 

TCTTATAAGTGCTTTCTCTTATGATCGAGAGTAAGACA 

AGCATGGCTACTCCCTTCTCTTCTATTAAATATTGTACT 

AGGGGTTCTATTGAGATAAATAGGCAAAAAACAAAAC 

AAAACAAAACAAAAAGGCATCCAGATTTAAAAAAAA 

AAGGAATCTAGGAATAAAGGGATTACATCTCTACTTG 

CAGATGACATGATCTTATGTATAGGAAATCCTAAGGA 

TCCACTGAAAAACTGTTAGAACTAATAACATCAGTAA 

GTTTGCAGGATTATAAGATTAATACAAAAACTCGACT 

GAATTTCTGTGCACTTGCAATAAACAACCCAAA 


SEQID 

NO: 76 


DG5S132 


170442700- 
170442947 


TCTGCCCACACACTTTATGCTTTAAAACAAAAGGCCAT 
GTTGAACTTGTAGAACCAAATGATTGCTAATTACTTGG 
GGCGATACTAGTGATATATTATCTTACATACACACACA 
CAAAACACACACACACACACACACACACACACACACG 
GCTTGAGTCCAGCATGGCCTACTGATTTTAAAATAGGA 
AATGACAGTGTAAATGCCAGGATAAAGGACAAAGTGC 
TCTGACCTGTTGCCAAACCTT 


SEQID 

NO: 77 


DG5S136 


170469573- 
170469843 


GGGTTTAGGACCCACGTATCTTTTGTTTGTTTGTTGTTG 

TTGTTGTTGTTTTGGAGTTCAACATGTTTATGGTGTGTA 

GCCATGGTTGAAAGCTTTTATTTTATAAGATAGACAAA 

GCAGGAATTATTATTCTCATTTTACAGGAGAAAAACA 

GAAGGGCTATGTGGTTTGTTAAAAGGCCACACAGACA 

GTAAAGAACACAGCCTTTACATGGTCAGCCTCACATTC 

TAGTACTCATTTTATTACACTGCTCTTCTTCTCTGTTGC 

CTG 


SEQID 
NO: 78 


DG5S133 


170480360- 
170480621 


CTGGCCTCTTTGCCCATTTTCTAATTGGATTATATGTGG 

GGTTTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTT 

TGCTGTTGTGACGTTAGAGTTCCTTGTATATTCTAGAT 

ATTAATCCCTTGCCAATTAATAGTTTGCCAGTATTTTCT 

CCCATCCTGTAGGTTGTTCACTCTGATGATTGTTTCCTT 

TGCTGTGCAGAAGGTATTTAATTTGATATAATCCCATT 

TATTTACTTTTGTTTCTGTTGCCTGTGA 


SEQ ID 
NO: 79 


DG5S17 


170499980- 
170500284 


CCATCCAGGGTCTAACTCCAGCATTTGTATAAACTTGG 

AC A ATA C 14 T 1 GCT A C AGGGTTGTCATTG A AAGT ATTG 

CCTCATTATATTTCTTAGTGGTCCCTGTATGAAGCCAT 

ATAAGAGAAACTTCTTAATTTAGCACTAGGAAATGCTT 

CTGTTGACTTGAGATGTGTGTGTGTGTGTGTGTGTGTG 

TGTGTGTGTCTGTGTGTGTGTGTGTGTGTCTGTGTGTGT 

GTGTATTCCCCTAATTGATAAACTATAAAATAATCTTT 

CTCTTTTCACTTTGGCCATCTGGAAATTTGCCACCAA 


SEQ ID 
NO: 80 


DG5S137 


170644993- 
170645364 


TGGCTTCCCAATCCTAGAAAAGGAAGAAAGCTGCATG 

TTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGT 

TGTTAGATGGAGTTTTCTGTCGCCCAGGCTGGAGTGCA 

GTCACATGATCTCAGCTCACTGCAGCCTCTGCCTCCCT 

GGTTCAAGCAATTCTCCAGTCTCAGCCTCCCAAGTAGC 

TGGGATTACAGGTGCGCACCACCACTCCAGGCTAATTT 

TTTGTATTTTTAGTAGAGACGGGGTTTCACGATGTTGG 

CCAGGCTGGTCTTGAACTCCTGACCTCAGGTGACCCAC 

CTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTG 

AGCCACCGCACCTGGCTGAAAGCTGCAT 


SEQID 
NO: 81 


DG5S53 


170673106- 
170673364 


GCCTTCGCAGATTGTACCTCTTCCTTTCACCCTTCTCGC 
TGGCCTGTGCTTCTCTCTCCATCGTGGTCTCCCACGCCT 
TGGTTTCTCCTCCATCCCCATCCCCATCTTTCGTGAGCC 
CCTCCAACTCTCTCCCCGTGTTTTGTACGGTCTCCTGCG 


SEQID 
NO: 82 
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TTCACTTGATTTCCTCTCACCCACCCCCGCCCCAAACA 

CACAGGCACACACACACACACACACACGCGCACACAC 

ACGGGCCTCTCGCACTCTCCTTCTCCT 




DG5S968 


170675807- 
170676033 


TGACTCTTGGCCTCTGTGTGTCTCTGGGTTTCTTTGTCT 

CCCTCCTCrCCACGGTCCTCTTGTCCTTTTGTCTTCCCT 

TTCTTGTTTCTTGAATCTCTTTGCCTTTATGTATCTGTCT 

TTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 

TCTTTCCTTTCTCCCTGACTCCCTCTCTCCCTCTTCCAG 

GCCCAGCTCCCAGTAGCTCCTAAGGCAAA 


SEQID 

NO: 83 


DG5S904 


170735417- 
170735632 


CATTTGGGATAAATGTTGTCTTTAGTTTTCAACTACTTT 

TTCTTTGGCTTATTCTCrCTCTCTCCCrCCCTCCCTCTCT 

CTCTCTCTCTCTCTCTCTGTGTGTGTGTGTGTGTGTGTG 

TGTGTATTCATGTTTTCTTAATCTATCTGAATTGTTGTG 

TCGGTTTTCCATGCGAATTTCCAGTTACCTCCACAGTA 

TTCGTTTCAGAATGCTTCCT 


SEQID 
NO: 84 


DG5S906 


170820130- 
170820505 


TTGCAGTTTCATGAACCAAGTATTACTGCCTCAACAAT 

TAAAAACAACAGACAAATTATTTAAAAAACCATGAGG 

CGAGTGGTGGCTGGTGGCTGGTGGCAGGGCGGGGGCA 

GGGTGGCCTCTGTGTCTCATGCTTTCTGGTTGGTCTGT 

GGTCTTTGCACTGAGAGCTAGGGCCTTGCACATTCATT 

CATTCATTCATTCATTCATTCATTCTTTGAATTCAACAT 

TACTATGCACCAGGCGCTGAGAAGGCAGCCTTAGACA 

GATGGAAATCCTTGCTTTCCGGGAGATTCCATTCTAAT 

GGGTCATTGATTCAGTGGCCTCTTCAGTCATTTGTTCA 

TATGCATTTACTCGTACCTCTCATGTGCCA 


SEQ ID 
NO: 85 


DG5S141 


170910447- 
170910786 


AACTGAACCTGGGCTGTGTCAGTCTAAGACTTATGCTT 

GGAACCTGTAAGAAGAACAAGTGTGCGTGCATGCATG 

TGTGTGTGTGTGTGTGTGTGTGTTTGGAGGGATAGTGA 

AGTTTTn'CCTACAGCTACAAATAGAACATGCTTTCCT 

ATACAATTGTACTAATCAATATTATTTCCTTACATTATC 

TCCAGCCATTTCCCTATAATTAGACATTCAGATTATTT 

CAAGGTTGTTATTCCTATAAACAGTGATGTGATGAATT 

TTTTAAAGTTGGTTCCTCACATCCGTCTGTTCTTGTAAA 

TGTATCCATCATAGAACTGGACCACAAAGGTTGG 


SEQ ID 

NO: 86 


DG5S909 


170941109- 
170941259 


GGGAGTTCTCTTTCTTCAAGTGCTTAGGGGAGAAAATA 

AGTGAGAAAAAGAGAGATAGAGGAAGGAAGGAAGGA 

AGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGA 

AGAAGAAAAGGAAACGAAAAGGAAAGGAAAGGAAG 

AAAGGA 


SEQID 
NO: 87 


DG5S910 


170946679- 
170947010 


TCCATGTTATTCTCTACCTGTTGGTTCCTTCCTCATTGA 

AAATTGGTGTATAGATGGTAGATAGATAGATGATAGA 

TAGATAGGTAGATAGATAGATAGATAGATAGATAGAT 

AGATAGATAGATAGATAGATAGATTTTTATTTTTGGTC 

TATCTCCTTTACTAAACAGTAAGCTCCATGAAAATATG 

GATCATCACTGTCTTATTCACCATTATATTCTCAGCAT 

ATGGTATTGTCCTGGTATAGAATAGATTCTCAATAAAT 

GCTTGCTAAATGAATGCATTCATGAGTGAGTGAATGA 

ATGAATATGCGAGTGGATGAGTGTGTGGA 


SEQID 
NO: 88 


DG5S911 


170985696- 
170986066 


TGCTTGAGGGCAGGATCTATATGTAATTCCTTTCTGGA 

AAACCAGGGATTGAAACAGGATCTGGCATGCAATGGG 

CTGGATGGGTGAATGGAGAAATAGATGAATGACAGAT 

GGATGGACAAACAGATGGAAGGAAGGATGGATGGAT 

AGATGGATGGATGGATGGTTGGATGGATGGATGGATG 

GATGGATGGATGGATGGATGGATGGATGGATGGACAG 

ATGGATTGGTTGGTAGATGTGTGGATAGATGGATGGG 

TGAACAAGCGAGTAGATGGATGAGTAAATGGCTAAAT 


SEQID 
NO: 89 



2345.2049-005 
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CTGGTGCTTTTCTTCCAGAATCCTGGATTCTGAAGGGA 
GGCTTTGCAGCCCTTCCTCGTGGATCACTTGCTCTG 




DG5S143 


171018986- 
171019237 


GGCTGAATTACTGGGCATGTTTCTGAGAAGAAAGAAC 

TTCTATTTTAATTATATATCTACAGAAACCAAATTGCC 

TGCTTACAGTTTTACATGTCTGATGATTGGAAGTTTTT 

GTTTGTTTGTTTGTTTGTTTGTTTGTTTCCACAGACTAG 

CCTCTGACTCCATATATTTCAAACTTTGTTCCTCTTCCA 

CTACCCACATATTTCTGATGTGAGACATTCTAGAAAAA 

TTTCATATTGCAAGACGGCTTC 


SEQ ID 

NO: 90 


DG5S513 


171039003- 
171039366 


TTGGCAGGATTCAGTTCCTCATGGGCACCAGACGGAG 

AGTCTCAGCAAATCACTAGCTGGTGACTGCAGCCACC 

GCAGTTCTTTATCACGTGGTTCTCTCCATAGGGCAGTT 

CACAGTGTGGTAACTTGCTTCATCAGAGCAAGCCAGG 

AAGAAGACCCAGAGACAGACAGAGAGAGAGAGAGAG 

AGAGAGAGAGAGAGGAGGCCTCTGAAAGAGAGAGAG 

AAGAGAGAGAGGCAAAGAGAGAATGAGAACTCCAGA 

AGTCACTGTCTTTTATAGCCTAATCTTGAAAGTGACAT 

TCATCACTITCACTACATTTTCTTCCCCAGCAGTGCTCA 

GTGGGAGGGGATTATACACGGCCATGGAT 


SEQ ID 
NO: 91 


DG5S145 


171040948- 
171041151 


GCACATTTGCAGAGGTTTGAGGTCCCATCATTAGCCAT 

GCTTCTTGGTTCCTGCACTATGAGTATACGTATGTGGG 

CTGATGGCCTCATTCACTGGATACACACACACACACAC 

ACACACACACACACACACACACACACCTCACCAGGGA 

CTTGGGAGTATCTAAATGTTTGAGAATCATAGAGCAG 

GGAGACATCCAACAC 


SEQ ID 

NO: 92 


DG5S146 


171073796- 
171074122 


GGGCACATACAGCTTTCCTTGCAGGAAAAAAACCTGC 

TTAACTTTGTTTATTATATATTATTTGATCTGTGCTTCA 

TATATTATTCATATATTATTTGATCAAGTTGCTTCATGT 

ATTATTTGATCAAGGAATCATGTGTGTCTACAGCACCT 

ATTAAAATTCCCTGGCACTGAAATTCTGTAGAAAACCA 

TTTAGGAAAAGTTGATCTAACTGTATAATTATTAGTAA 

AACATATACACACACACATACACACACACACACACAC 

ACACACACACACCACAAGCAAACAAAAAAAAACCAC 

CTTAATGGTCTCCTAACCAAGGCA 


SEQ ID 

NO: 93 


DG5S147 


171107565- 
171107831 


GCATGTTCGCCACAGAGATTCAATTAATTTAAATAGGT 

AGAGGACTTGGGGCAGTGCCTAGGACAACATTACACT 

CAGGGATGGTGATGATGATGATTATAATGATGATGAT 

GATGTTGATGATGATGATGTTGATAATGATGATGATGA 

TGATGATGATGATGATCATGATGATAATGGAAAAGAA 

GATAGAGGAGGTAGAAGAGGAGACAATCATGATGTTG 

GAGGTAGACTCCAATCTTCAGAATCAGAAGCTCAGGG 

TTGGA 


SEQ ID 
NO: 94 


D5S462 


171134297- 
171134396 


AGCTTAATCTATTATTTNAGAGGCAGAAAGTTAACTTG 

CTTATCCTGAAAAGAAGTGCAAATATATCCCAAAAGT 

GCCATTCTTTCATTCATCCACTCAAACAGATACACACA 

CACACACATACACACACACACACACACACACCCTTTTC 

ACCCCTTGGTAGTGTACAGTCTCTGAGTTGTAAAAAAT 

AGTCATTNCTTTCTGCTTGAAAGACTGTATTAGCT 


SEQ ID 
NO: 95 


DG5S148 


171140975- 
171141303 


CCAGCATGATCCTATGAATCCTTATAAAAGGGAGATG 

GGAAAATTTCACACACACACACACACACACACACACA 

CACACACACACACACACAGATAGAGACAGAGAGAAG 

AGGATAAGGCAATGTAACCATGGAGGCAGAGATGGG 

AGTGCTGTAGCCACAAGCTAAGGAATGCTGGCAGCCA 

CAGATGCTGGAAGAGTTGAAGAATGGGTTCCCCCTGA 

GGGAGCACAGCCAGATGCATGCTTTGAGAGTTCAGCC 

CAGTGCTACTGACTTTAGACTTATGGTTTCCAGAACTA 


SEQ ID 
NO: 96 



2345.2049-005 
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CAAAAAATTAATTTCTGTCGTTTCAAACCATCC 




DG5S914 


171219902- 
171220159 


CAAACGTCGCTGACCTGAGTCTGACCTGGGCTGCCTCG 

TGTTACCAACATGAAAAGGGAGTGAGAAAATCTGAGG 

CCAATTAACirCTCTCCCTCTCTCTCTCTTTTTCTCCCCT 

TGCCCACCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC 

TCTCTCTCTCTCTTTTTTCCCCCTCCTCTTCTTGGAGAC 

ATGATGAAATTTCCTGAAACAAAAACTCGCAGCCCGT 

TCAATAAAATGCTTTCGCCTTTGGTG 


SEQID 
NO: 97 


DG5S150 


171232854- 
171233077 


ACAGTTGCCATTTGCTCATTTAAAATGTAGTGAGGTGT 

TTTAAAGAGGGTTTGTTCAATTTACCAAAAAGGGAAA 

AAAAGGGAAAAGAAGAAACTTATTGTTGAACGAACAC 

ACACACACACACACACACACAAAGAGCCTGGCTTAAT 

TTAGGGATAAAGCAAAGAAGTCAATACCCCCACATCA 

ACTATTGAAACCTAAGCTATTGCTGGAGTTGACAGCG 


SEQ ID 

NO: 98 


D5S429 


171276128- 
171276490 


AGCTCTN CCTAGC ATTGTTTTCCTTTGCTTC ATTTCCTC 

TTAAATGTGTTGGATGCACTTNGTTCCTGCTAACTAAT 

CTATCTTTNCAGTTTCAAATCAAATGAACCCAGAGAAT 

TTATTTTTACATTATTATCTTCAGATTTAGATTTGTTTT 

GCTTTTAATCCTGTCTTCATGAAGGGGAAAGCCATGTG 

TACCAGCATGGTTGATAAACCACCAAATCGTGAAACT 

TTGCTTGCTCCCCAAACCCCCAACCACACACACATACA 

CACACACACACACACATACACACACACACACACACAC 

ACACACACACACACACACAACCTGGGAAATTGGGNAG 

AAAACTGGCAAACCTTAAACTAG 


SEQ ID 

NO: 99 



5 



15 



2345.2049-005 
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Table 7 The DNA sequence of the microsatellites employed for the association 
studies across KChlPl (including Build 33 locations). 



NAME 


POSITION 


SEQUENCE 


SEQIDNO 


DG5S1173 


169653708- 
169653840 


AAGGGAAGGGAGGAAGGAAGAAAGGAAGGAAAG 
AAGGCAGGAAGGAAGGAAGGAAGGAAGGAAGAA 
AGGAAGGAAGGAAGGAAGGAAAAATAACTAGGG 
CCTTTCACTTTTGCCTTCAATAGCAGAGTGGCC 


SEQ ID NO: 
100 


DG5S44 


169661202 - 
169661574 


TATTGGCAGAGGGTGAGTCCAGTGTATAAAAGCAA 

CTATATTTGTGCAATAAGGCAACCTCTAAACACAA 

GTTACTACTTCATCTAATGCCACACACACACACAC 

ACACACACACACACACACACGAGTCATCTGTTCCA 

AGGCTGTTGCCTTTACTAAGTGATGCTATGTTGGTC 

CTTGAGGTGGTGCCTTCCTGAGGGrrnCAAGCAT 

AGCTTTGGCCATGCACAGTTTTCTTCTTATACACAC 

TCTGAGGAGCCCCGCCGTCACGGTAATGCACCTGC 

CTCACAAGCTGGTGGGCAGCTTAAATGAAATACAC 

ATTTTGCTCCAGGCCCAGCACTAGCTCATCAATGT 

GAGCTGGTGTTAGCCTCACC 


SEQ ID NO: 
101 


DG5S45 


169693772 - 
169693912 


CAGTAGCCAGGAAGCTGAGGAACACACACACACA 

CACACACACACACACACACACACACAAACACACC 

CCTTCCTGGCTCCAGTTCCGCACCACCCCACACCCC 

CAACACCGGAAGTAGATTTCTCAATAGGCAGGGCT 

G 


SEQ ID NO: 
102 


DG5S46 


169702377 - 
169702678 


TTTGCCAGAATGTCCTCACACCAAATAGTGGACCC 

CITCITTTGCTGATTTATCTGCTATTGTATAGGTGT 

ATGTGTGTGTGGGTGTGTGTGTGTGTGTGTGTGTTA 

AGGCAGGTGGTAGTATGTGTAGGGTAGGGTTTCCC 

CAGTCACCTGGAGCCCTGAGTGCCTGCTTCCCTAA 

ACTAGGCCAGTTTAGCTGACTGGCTTCCTTTGTGTA 

TTGGTCCATTCTGCATCAAAAGCATCTGAATTTTCA 

TTCAATCTCTCTTCTGAATTTTCACTTTTAAAAACC 

TGACCAGTCCCTTGTG 


SEQ ID NO: 
103 


DG5S1178 


169745438 - 
169745539 


GTGCTCAATGGCTGTTGAATAAATAAATGAGAGGA 

GGAAAGAAGGAAACAAGGAAGGAAGGAAGGAAG 

GAAGGAAGGGAGGGAGAGAGGGAGGGAAGGAGG 


SEQ ID NO: 
104 


DG5S47 


169788696 - 
169788899 


CTCCTCCATGGTAGGGACTGGTTCTCTTAGGCCCCT 

GTATCCTCAGGCCCAGCATGCTTGGGAAAATGTTT 

GCTAATGCTTTGTGACTCAAAAGGAATCACACACA 

CACACACACACACACACAAACACACACACACAGT 

TTTTAATATTATCAGTCATATCAGCCCCCTGAGGCA 

GCTGCTCTGTTCCAGACAAACCCTGTT 


SEQ ID NO: 
105 


DG5S1592 


169794522 - 
169794686 


TTGAGCTGTTTGGCCTCAATGGCATTTTATCTCTCT 

CTCTCTCTGTGTCTCTCTCTTTCTCTTTTTTTTTTTTT 

CACATTGAGCCATCTTCTTACAGCTGAGGTTTTCAT 

ATAAAAAAGCAAGTTGCTGGTTTCTCTTTAAAAGT 

AGGGCAATCTGGCAGTTCT 


SEQ ID NO: 
106 



2345.2049-005 
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DG5S119 


169843903- 
169844041 


GGGTACAGGAGAGTTGTGGTGGGCATTAGTACTAC 
TCCTGCTGCTGCTGCTGCTGCTGCTGCTGTGTCCAC 
TGTTAGTGACAGAAGTGGGAAAATATTTAAGTTGA 
GTTCACATTAGTGTTCCCAGTTTAGCGTGAGC 


SEQ ID NO: 
107 


DG5S955 


169951970- 
169952619 


ACTTATGGAACACCTACTCAGTGCCAGGTATTGTT 

GTAGATGCCAGGAGTACAGCAGGGAATAAAACAA 

CATCCCTGTCCTCGACACAAACACACAAGTAAATA 

GAGAAGGTCAGAGATAAATGCTGTGCAGGAAAAC 

AAAGCAAAGTGAGGGATGGAGAGTGCGGAAGGTT 

GGGGCACTTTTGTTTCAGATGAGTGTCAGGGAAGC 

CCCCTTGGAGGAGGCACTGTAAGGGCACAGAATC 

GAATGAAAG 

GAGTATGTGAAGGTGCTTAAATTGTTTCTGTTTGGT 

TTGGTGTGGTGTGATGTGGTGTGGTGTGGTGTGGT 

GTGGTGTGGTGTGGTGTGGTGTGGTGTGGTGTGGT 

GTGGTGTGATGTGATGTGGTGTGCGGTGCGGTGCG 

GTGCGGTGCGGTGCGGTGTGGTGTGGTATGGGTTG 

AGGCTGGCCTTAGGAGCCTGTTGGCCTTCCAGGCC 

AGTCCTGAAGCCCAGCCCAGAGCACCAGACTCTGC 

AGTCAGTCAGTGGAGGGCCCACATCTCAGCCAATG 

CATGGCTTTGGGTGGTGACTTCATCTCCCCTAGTGT 

TCCTTTCCCCCTCTGCAAAATGGGAATGGGGATGG 

CTCAGAACTCCCAGCGGGAGTTAGGAGGAATAAT 

GTATAGGAAGTATGAGCAGAGTGCCTGG 


SEQ ID NO: 
108 


DG5S13 


169961410- 
169961530 


TGATGTGCTCGTTCCCATAGCCCCGCTGTGTGTGTG 
TGCGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTTTGGTGGGGTGGGAGGGGAGGCAGAAG 
AGGAAGAGAGGGCA 


SEQ ID NO: 
109 


DG5S123 


170015858- 
170015997 


TGGTGATCAGCTCAGTGTCCTTGGAAAAGAGCAGA 
AAGTGGTATCACGAACATATCTTCTCCTTTGCTTCC 
TTCTCCTCACTCTTCATCATCATCATCATCATCATC 
ATCAAATATGGATCTGTGAGGCTACCTCTGGG 


SEQ ID NO: 
110 




17U041996 - 
170042336 


GGAGGAGAGACCAGCATTCACATTCAGTTATTGTT 

GTTTTAAATCCATTACGCACATACATAGGAGAAAA 

TTTCAGCAACAGTCACCCTCTGAACCCAGTTCCTC 

AGTTCTCTCCAGAGGCAACTAAAATGCTCAATTAT 

TAGTGTATCCTTTTGGAAATATTTTATGTATATGAC 

AGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

TTCCTTTCCAATATTAAAATAATATTAACATTGGTA 

ATAGTGGTACTAAACAACTTAGGGTGTm "1" I" 1" 1T1 1 

CATTTAATAGTATATTTTTAGTATCTTTCCAGGAAA 

AGATACATGGATGTGCCACA 


SEQ ID NO: 
111 



2345.2049-005 
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D5S625 


170105556- 
170105787 


TCAATAAATGATTCTGGGGATGTGTCTGTCTGTCC 

ATCTGTCTCTCTCNAGANACANATACACACACACA 

CACACACACACACACACACACACATCCTGTTAGTT 

CTGTTTACCTGGAGAACCTTGACTAACATACCCAT 

TAAAACCAAAATATGTCCTTCAGGGTGTTAATGTT 

TGGTTGAAGAAACACAGAAGTTTAACAATTGTATC 

AGGCTGGGCACGGCCTATAATCCCAGCATTTTGGG 

AGGCCACAATGAGNGGATCACTTGAGCCCAGGAG 

TTCTAGACCAGCCTATGCAACATAGTGAGACAAAA 

AAATGAANAAAATTAGGGGTGTGGTGGAGCGCAC 

CTGTAGTCCTAGCT 


SEQ ID NO: 
112 


DG5S959 


170167429- 
170167616 


GAGTTCTATGGAACAGCATTTATTGAATAATAACA 
TTTCAGGAAAAAATATAAGCTTTACTGTATATTAA 
AATACATATATACGTTTATATATTATATATTATATT 
ATATTATATATTATATATATTATATATATTATAATA 
TTTATATATTATATAGATATAAATCAACTACAAGA 
TCCAGTTCAA 


SEQ ID NO: 
113 



Table 8: The Build 33 location and size of KChlPl exons. 

5 



EXON 


START (NBCI33) 


END (B33) 


Size (bp) 


1a 


169716298 


169716511 


214 


UTR 1 


169848417 


169848523 


107 


UTR2 


169861083 


169861154 


72 


UTR 3 


169864589 


169864679 


91 


UTR 4 


169867066 


169867173 


108 


1b 


169867120 


169867180 


61 


Ins-r 


170075401 


170075433 


33 


2 


170081305 


170081429 


125 


3 


170082868 


170082937 


70 


4 


170084380 


170084450 


71 


5 


170085260 


170085367 


108 


6 


170095347 


170095451 


105 


7 


170096383 


170096445 


63 


8 


170098306 


170099177 


872 



15 



2345.2049-005 
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Table 9. The Build 33 location of SNPs found across KChlPl after the first round of 
sequencing that was limited to the exons and flanking sequences. 



START (B33) 


MARKER 


VARIATION 


169716197 


KCP_ela_249924 


C/G 


169716300 


KCP_ela_250027 


C/T 


169716322 


KCP_ela_250049 


A/C 


169740666 


KNB_24222 


A/G 


169740703 


KNB 24259 


A/G 


169741172 


KNB_24728 


G/T 


169746339 


KNB 29895 


C/T 


169747941 


KNB31497 


A/G 


169751742 


KNB_35298 


A/T 


169751814 


KNB_35370 


C/G 


1 V /AT ■* ft A 

l 6975 l 843 


KNB_35399 


A/G 


169848476 


KCP_UTR1_382206 


C/T 


169848542 


KCP_UTR1_382272 


A/C 


169861338 


KCP_3UTR2_395068 


A/G 


169864750 


KCP_3UTR3_398480 


C/T 


169864875 


KCP_3UTR3_398605 


C/T 


169866182 


KCP_elb_399912 


G/T 


-t *— T A An 1 /-V^% 

I 70081292 


KCP_1152 


C/T 


170081464 


KCP_1324 


G/C 


170081473 


KCP_1333 


A/G 


1 /UUoZ /oV 




C/T 


170085097 


KCP_4957 


C/T 


170085116 


KCP_4976 


C/T 


170085151 


KCP_5011 


A/T 


170085191 


KCP_5051 


C/T 


170085217 


KCP_5077 


A/T 


170085342 


KCP_5202 


A/C 


170095344 


KCPJ5204 


C/T 



2345.2049-005 
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170095540 


KCP_ 


15400 


C/T 


170096292 


KCP_ 


.16152 


A/G 


170098209 


KCP_ 


.18069 


C/T 









Table 10. The DNA sequence of the SNPs identified across KChlPl. 



NAME 


SEQUENCE LISTING 


SEQ ID 
NO. 


KChlPl 


See FIG. 1 


SEQ ID 
NOl 


SG05S872 


TGGCTGTCCCCTCTGCCTGGAGCAGGCTTTGCCCAGATGTC 

CTCCTGGCTCACTCCCTCACCTCCTTATGTCTTGACTCAGAG 

GTCACCCTTCCAGATTAGACTGCCTGACCCCTTCTGTGCTTT 

CTGTTTTCTCCTTATTACAAATGAATCTGCACCATATTTCAC 

TGATTGTGTTTGCTGCATGCATGAGGGCTCACATAAGGATG 

TGCTTTTTGTCCACTTTGTTCATTGCTGAATCACTAGCACTG 

ACAGCTGTACCTGGCACAAACTGGGTGCTTAAGAAATATTC 

TTGAATCAAGGAATCAATAAATGAATGTTATAGAGAAAGC 

AGGAGAATAGATGATAATTGAGAAAACTGAAGCCCAGAGA 

TGGGAAGTCACTGGCCCCATGTCACACAGCAGCAAATGCA 

GAACCGGTCCTGGAACTTCAGCCTCTCAGCCCCGGCCCTGT 

CCTCTCCTGTGCTTCTCACCACTTTATGTAAGTTTTTTCTTTA 

TTTGTGGAGCTCTCAGCAGGCATTTTTCTCTCTGTGCTCAGT 

TGGCATTTTTCCCTTGAACCAGCTGTGTCTTCACTCTCTTCC 

CCATTTTCTCCAGAATATGTTCTTCTGTTTAACTGAATGTTC 

TCTTTTTCTGCAGGTCTGGCCCAACTGCAATATCCAGAGAC 

TTTTCGGTGTCATATGAAAGAAAAGGAGCAGGAAGCCAAG 

ATGCCCCACCTGGCTTCTACATCAGGGTGATCTGCATAGTA 

AGATGCAAAGACACTGACATATGCCTGGGGGTAACGAGGG 

CAGTGGGGGGAGGGAGCTAAGCCAAGATAAGCCTCCTCCC 

CACCAAACATAGGTGCTACTGAGCAATGATAGGGGGCATG 

CTGTCTGCTCTGGTACTTGCGTAGGGAATGCTCTGAGAAAC 

CTCACTAAATCTGCCCTCTAGAGTAGAGCAACCTGGGAGCT 

CAGGCTTCCCTTTCCTCTGTGTGATGGGTTGGCGGTCCTTAG 

AGCCAGCCATTTC[A/G]TCCTGCTCCTTCTCTCCTCCCCTTCC 

TGACCAATAAAGATTGTGTGCTTCTGCCCAGTCAGCAGGGT 

GGGCTCTCACTCCATCCTGCCTCTGGTATGACAGCACAATT 

CCCCTCATTCTTTATAATCATTATAAAATAAAATAACTACCT 

TTTAGAATACTTATTTGATATGAGGCACTTTGCAAACCCAC 

AGTCCTGCATATCCCATTTGACATATCAGGATGCTGGGCTT 

ACAGGTTACCCCAGGGGTGGAGTTGGGCTCAATCCTAGGAT 

TGTCTGCATCTGATTCTGAAGCTTGTTTTCTTTTCCCCTATA 

CACAATCATTCATTCATTCATTCAGTAATTTTTAAATTGAGA 

CATACTATGTACCAGCACCTGTTCTAAGCATTGGATTATGG 

TGATGAATGAGGCAGACAGGGTCCTTCCCACAAATAACTA 

ACTCTATTCAAGCAGTGGGAGAAAAAGCAATGAATGGGAA 

ATAAATGCACAAATCAAGTAATGTTGGATGGGACAACTGCT 

GTGGTCCCATTGAAACAAGCCCAGAGTGAGCCCAGTGTAG 

GGACTTCTTCATTGACTGGTTGGGAATTGAGTGACAATCGG 

TTGCTGCATGCTGATGGGTGCCAAATACAACCGTAAGGAAA 

CACTCCCCTGGGAGGGAGGCGGGATCCAGGTTAGGAAAGA 


SEQ ID 
NO. 114 
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GCCTTGGATTGAGGCAGAGTGTCAGGAAGTGGGGAGGTAC 

GCAGCTGACCTTGGAGAAAATCCCTGAGTGGTGCAGATCTC 

TTGAATCTCTGAGTGGCTCAGAGTCTTCCTGGAAATGCAGA 

AATCCCCATGCCACTTAGGGGCATCTTCATTCATCTCCAGC 

CCTCCTTTATTAAGTCATGTATACCATCTCCTCTCTTATGCT 

TAATGTCATGCCACTCTTCAATCCTTGTCCCTTCTTTCCCTCT 

GTGCCTGCTTGTGGTTTACTCCTGCTGACACCAAAGGCTGA 

GGAGGATGAAAGAACAATTCCAGCCCTGAC 




SG05S873 


GGTAATTCTTAAGCTGGCTGGGCCTAAAACTGCAAACTGGTATTGG 
GCATGCCAGAAGGTAACCATAAATGGGCTATTTGGAGATTTCTAGG 
AAGAAGAATGACATTTTGTTTCATTCCATTCCATTTCATTTCATTC 
CATTAATACTAAAAATATTAACTAAAGCATCATTTCTACTATATAT 
CCAGAAGAGAACATGGTCTTAGGTCTTTTAATAAATGAACTTCAGT 
TGCAAACTTTCTGCTGTGACGTTATATTTCTCTTTCCACCCTAGAC 
CAGCCCCTAATGGGGCCATGAAGTCAGATTTTTGGTTCATGGTGTT 
GTCGGGGCAGCATAGCCCAGAATTCCACTTCCTTCCCTGAGGACAC 
ATTTATTCTGGTAGATGTGCTGTTTTCCATTTAAATGTCCTTTGGC 
AATAAAAGAGCTGGCTCCAACAGCAGACCACGGGGCTGGCTTTGTC 
GGCAGACACCACGTGTTCATGACTGGCAGCTTTGTCTGGAAGAGGG 
AGCTTTTAAAATGCAGTTCTATGCTGACTCTTTGGAGTCTTCCCAG 
GAAGATAACTGCTATTGCATTGCATGCTTAATTTAGAGCACCTATT 
TTTCCCTCTCCTTCAAGGTTTCTGTATATCTTCTCAGTTCATGAAA 
TTAATTATTTGGGTACAATAATTGTACAAAGGCACTTTATCAGACA 
CTTCGTATAAATTATTTCTCATTCTCAAGGCAACTTGGAAAGGTCA 
GTCTAGGGGTCAGCTGCTACTTTTGGTGATCAGGCATCACCCCCTC 
CTTCCTCTTAGTACGTTATGACAGTGGCAAGTGAGCATTACCTGTG 
GACCCCAAAGGAGTTCATTTCCTTAGAGCCAGCCATTCCTCAGTTA 
ATCTGG 

TCTGTCAGACACTCTGTCCCAGGACACTGAGCCTTGAGCAT 

GTGAAGGTGTGGGCTCTGCTGGGGGCTTGGCAGCCAGCACC 

TGTCTGTGTATCACCTGGCTCCTGCAGCGAGAACCTGC[A/G] 

GTGTGATTTCTGCAGCCTGGCCCTCTGAGATTCCATGGCTG 

CTGACCATTTTCCACTTTCCAAGACTGTTCACATTCCCAGCA 

ATTCTGTGAGGCCCTGGCCTTCAAAGGTGTTCAATACATTC 


SEQ ID 
NO. 115 


crrriiirijjri rrrrrrrrrrrTGAGACAGAGTCTCACTCT 

GTCACCCAGCCTGGAGTGCAGTGGTGCCATTTCAGCTCACC 

GCAACCTCCACCTCTCGGGTTCAAGCAATTCTTCTGCCTCTG 

TCTCGCAAGTAGTTGGGATTACAGGCACACATTGCCACTTA 

CGGCTTTTTTATCATTATTATTATTATTTATTTTTAGTAGAG 

ATGCAGTTTTGCCATGTTGGCCAGGCTGGCCTTGAACTCCT 

GGTCTCAAGTGATTCACCCACCTCAGCCTCCCAAAGTGCTG 

GGTTTACAGGCGTGAGCCACTGTGCTGGGCCCCATTTGTTA 

TTTAAGGGAGAGTCCGTTTCTGCTGTTTGTAACTAAGGACC 

TGTCTGATCTCTAGGAATTATTGACCCCAGTTTTCAGATAA 

AGAAGTTAAGCTTGAGGTTAGAGCTTTTGAGCAAAAACTCC 

TCTCCTAGAGAACTCAAGTATCCAGGAATACTCGGTCAAGG 

CTGGGCTGGACCAGGTCTGTAATCCTGATATTCAGAAAAGG 

GATGATTTCTCCTCTTTGGTTTGGTTTTCTCACTGAGGCCTG 

CACACCAGTTTATTTCCTGACTTGTGCATTCAACATGGGCA 

AATCCAGGTCAACAAAGACTGGCAGCTTATTCCTGAGTACA 

GTTCCACCAGGTATGGCACACAAAGTGATATGAGTTAGAAC 

ACAGATGGATATAGATGTTTTACAAATGTAAGTTTGCATAA 

CACACACACACACATTGCTATGTGTTAGAAAAATACAATAA 

GCTCATCTAATTTATTATTTCATGTGTCTTATTGCTCAGAAA 

GAGGAAAAGATTTTATTGAAGTTGAGAAAAGAAATTGAAT 

TAAAATAATA 



2345.2049-005 
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KCP_rs31 
5773 


AGAAACTCCGACTGTCTTTCAGCACACAGAAGACACTGTAC 

TGGACCCGGACATTAGGCAGACACCCACGCCTGACTTTCAG 

GAGAAAAGAGAACATGACTAACGGATATTCTTAGTAGATG 

GTTTATTAGAAAAGAGAACATCTTCCAGCATGTGTCCTGGG 

GTGATGGGTGTGGGAAGCACTCAGTCCATAGTCCTGGTCCC 

TGGCTTCCCCAAGCCCAGCACCATGAATGTACAGTGGAAAG 

CAGAGGGTGCAGCGTCTCAGAAAGATGCTTCCACTCACAA 

GGATTGGAGCTCACAAGTGAGCTCCATAACCTGCAAACCA 

GAGAAACCTGAGACACTGCCCCTTGGCCATTTTATCAACGG 

AGACTTTATTGTGATTATCCCGGCAGGGGGCCGAGCTCTCC 

TCTCTGCAACAGGAAATGCTCTTTAGTGAAAATGCAGCATT 

TCTCCAAGGGTAACAAAGCTGAACGCCTGCTTAGCTTATGA 

ACCCTCAGTTGGCCTAGGTGGTGCAAAGACCCTGCTGTTAC 

TGCTTTGATCATCAGTACTGTGGACTGTACCAGGAGATCCC 

TGGGAATGTGCTCTGGGCGGAAGCAGCTTTTATCTTTGGCC 

CTCACCCATGCTTTATATGGTGAGGTTGGGAAAATGGCACA 

AGGCTTCTCCTGAACCTCAAATCAACACCCTTGCCCCATTT 

AGATCCTATCTGGCTGTTTCTTGCTAATATTACTGCATCACT 

GCACCATCTTTCCTATTTCAGCAAAGTGGAGTCATGTGTGG 

TTTATGGGGTAGATGGACCCCAAAACTGATAATATGAATCA 

AGCTATGGTGTTTACTCCCTAGGAAATGCACAATTTTTCTG 

GAAACCTACAGAAGCTTCAAATGCATTCGCCATGCAAAGCT 

AAGTCAGCAGAACAACCCGTTTGGCTTTGGAGGCTAGTTCA 

GTTCCGCGGACAGGGAGAAAGATGAGGCAGACTGTGGTTT 

TTCAGTTCCTGGAGCTTAC[A/G]GAGCTCCAAAGCTCCCTCT 

CTTCCC A CCCTGGCTGC ACTGTTCTTA A FIT! AG AT AATACC 

CTGCCTTCTCGTATTGCTGCTGAGCTCCTAGCATCCTCAGTT 

TATCTGTCTGTGAAATGAAAAATCTAATGTTAAATTTTTTAC 

CTATGGCATGAGAGAGATGGCTATGGCTCTTGTGAGCCTCT 

CTGCAGCCCCTCTTTTCCTTCAATCACCCTCTGTCTCTCCTG 

CCTTCTGCTTATTCTCTCTCTCCCCTCATCCCCACTTTCCCAG 

TGGGTCCTCTGTTCTC nTTTTTTI TTCTTTTTAA ATCTCTCT 

ATGCCTCCAGCCGAGAAGATAAAGAGTGTACATCTTTCTGG 

TTAAAAAGTTTTGCTTTGCAGAAACACAGCCAATTTATGAT 

TCTGGCCTTCCCAGCTAGGGACAGTGTTCATTTACATTTAG 

GACCATGAGGAGAGAGGCTTAGCTGTGTGTTTCTGAGGCCG 

GAGAAAATTACAGTGATATATAACAGTGCTGCACTCATAGA 

GGTGCTGAGCCGGGGTTGGGCTCAGGCGGCCGCTAAGCTC 

AGAGTGGAAAGTTTCAGAGGGGAGGCAGAAAGGAGAGGTC 

TATAGCTCCTCCAGATTCTAGGTATTAATTTACTAAGATATT 

CCTAAGCCAGAAAACAGAGACAGAAGACAAAGAGAAAGA 

GGGAAGAAGAGCAAGACAGAGAGTTAGAGAGAGACAAAG 

AGAGAGAGTTAGAGACAAAGAGAGAGTGGAGAGGAGAGA 

GAGCAAATATTGAAAGGAAAAGGAAAAAGAAAGAAACCT 

GACAGCTCATGAACTTTTTAAAAAGTTACAAATTAGATTTG 

AAGAGATGGGCAGAGGTTTAAGATTTCTTCATTAGGCTGGG 

TGTGGTGGCTCATGCCTGTAATTGCAGCACTCTGGGAGGCT 

GAGGGTGGCAGATCATCTGAGGTCAGGAGTTCGACACCAG 

ACAGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATAC 


SEQ ID 
NO. 116 


SG05S876 


TATATACAACCTGGAAGCTCTTTTTCCAACCATATCACAGA 

CAAAGAAATTGAGGCTTGTACAGGTGAAGGGGCCTGCCTTT 

CCTTTGCTCACAGGAATGTGAGGATGATACAAAAGTGAAG 

GATATTGGCATTCTTCAGGCAGGGAGATAACCTGGACAGG 

GGTGGTGCAGCAGGCATGTGCATAAAAGGAGCAAGAGAAG 

CCTTCTCTGTCGTGAGCAAGCTTGCAGGCCAGATGGAGAAA 

AATGAAGTAAAGTCACCCCAAAGCCTGGATTCTCATCTGGA 


SEQ ID 
NO. 117 



-132- 





GTGCCTCITGCCTCITGC 

CACTGGGCTGGAATTCCACTAAGAATTGAGTTGATTTCGTC 

ATCTGAGGCCCTGGGCACAATGACAAGGGTGGTTTTCTCGG 

ATCTGCAGTGAGCATTACACCAGAGTGTGGGAAACAGTGC 

CTACTCAGGGACCCCACTCTGGGACCCAGGGCAAACTTGCC 

ATCGTCTCCAGTCAGCTCATTAGCCGCCCAGGACTCTGCCA 

GCCCATCCAGGCAGTGATGTAATTACCAAAATGGAGATGA 

ATATTTAAAGGGACTCTTACTTAACCGATATACTTCCTCTCC 

AAGTTCCCTCCTTCACCGGCTCTGGATGAATTTCTGGAGGG 

ATTGCTCTGACATAGGCCCAGAGCTACCTGTGGTTTGACCT 

CATCATGAGGCCTTTCTTCACCCTTTCTTGGTGGCTTGCCTT 

GAGGGTGTTAGGAGATGGTCCATTGTCTGACTGTGAACAGC 

AGGGCAGCTCTTATATTCTCCATCAATGGATCTCTGGGGAC 

AAGACCCAGATGGGTGGGGGGACAGGGGAAGGAAACATA 

AAAGCCAAAGGGACTGGATACCTGTAACTAATTACCCCTTT 

ACTGTTTCTGTCACCAGACCTTAGTGCCACAAAGGATTGGG 

GGTCATTTGTGACAATGTATGTTGTAAAATGTAAAATGCAA 

GTGACCACAAATCTGAAAGC[A/G]GTATAGAGCTTTGGTTA 

AAATAATGCAGGCTCTCCACTGGCATTATTATTGTTGTTAG 

GAGAGTCTGGTGCTCTGTTCAAGGGCTTTTCTGTGCTATGG 

ATTATCTCTGTTTAGCACAAAATATCTTGTGTCCCTGGAAAC 

CCCTTAGTCCTGAGAAAACCAGGGCAGTTGGTCACCCCCCT 

GTTCAATGCAGGCATCAGTTCCACTAGGTAGGGGGTCTTAG 

CTGCATTTTAAAGATAAGGAAATAAAGACTTAATGGGTTGG 

AATAACTGGGTTATGTGCACATAGCTAAAGAATGGTTACAC 

AAACAACTTCAAGTCAAATATTAGACCTGCGTATTCCTAAA 

ATCCCTATGGCTGTTTGCAATAACTTGAGGCCAGCCTCCCT 

CTCCTCTTTTCTAAGCCCTCTTTACCTTTCTGTGTCCTCTGAT 

GGCTGTTGTTTATCAAGGCAACCATCGTGATTCATACCTCA 

AAGCACGCTTTGAATTCTACTCCTATAGGCTCCAAAACCCT 

TATTATCCAGGTTCAGTATTGCTCTAAACTAGGTGAGGTCC 

TGAACAGACCCAGATTTCAAGCATATTCAGGTGGATTTGTT 

TAACAGAGTGTGGCTACTGGAACATCTGGAGCCCAAAGTA 

CACAGGAGGCAGGAGAGAGCCTACTTTCCTGAAGAGAGGG 

ACGGGCCAACTGTCCGACAATGAGGAGGTGGGCATTCTTTC 

CTTTGTAAAACAAAAAGTATCTGAGACAGGGGTCAGTCAAT 

TCAGAAGCTTATTTTGCCAAACTTATGGACCATAACCCATG 

ACACAGCCTCAAGAGGTCCTGAGAACATGTGCCCGAGGTG 

GCTGGGTTACATCTTGGTTTTACATGTTTGAGGGAGACTGA 

AGACATCAGTCAATACATGTGAGGCATACATTGGTTGGGTC 

CAGAAAGGCGGGACAACTTCAGAGGTGGGGAGTGGCTTTT 

AGGTCATGGGTGGATTCAAAGATTTTCTGGTTGGCAATTGG 




KCP_rs95 
2767 


AAAAGTAGCATCGAGAATCAATTTGCATCTCAGAATTGGGA 

TCCCTGCCCTAATCTCTCTACTTTATGCGGCCGTGTCCTGCT 

TTTCATGACTCTAGAAAGCAGAGGAGAAAGTGGATGTAAG 

ATATAAATTAGTCTGTCTTGTAGGGCTTTCTCTTGGTCCCAT 

TCTGGGACCAGCCAGTGTCCATACCTGTGGCCTTTGGTATC 

CAATTTAAGGCAGTTCTTCTCTTTCCATGATCACACAGTAA 

AGGAGCCCCCGTATACAGTGCTCCAGGACTGAGTCCAGTTT 

TTAGTGTAGCGTGCAACAAGAGCAGAAAAGGCAGAGTTGG 

GAAGGACATGTCAACGGGCAGCAATGAGGTGGTATAAAGA 

CCCTGGGCATTTGGAGGCAACAGAGGGAGAAAGGTCTGCT 

TCAAGGACCAACTTGGTCTCTTCCTATCTCTGCCCTGGCAGC 

ACCAGCAGCTGCACATTGGCCCTTCTTACCACTTCCATGGC 

AAAACCAAG[G/T]TTTCTCTACCTCGCCTAGCCGGCCCCTGC 

AGACTTGCTGACACAGCTGAGTGCGGAGTGCATCTAGACCC 


SEQ ID 
NO. 118 



-133- 





CAACATGAGGCGCCCTTCTCTCAAAACAAATGAGCCTTCGA 

AACTCCAGCAAACAGTGCTAATGAATTGCCCTCGGCTTCTT 

AGGCATCATTTTCTCGTAATTATAATGGGAAGAAGACATGG 

AGTCCCACTGAGAACGTGGAGCTAGCCTGCCCCTAGAGCA 

AGGCAAAATCCCTCTCTGAGGACCACACTCAAGCAGAACT 

GATTTTTCTAAGACTTAGAGAAGAAACAAAATCTGATTTAA 

TTCTTAGGAAATTGCTTTTTTTAACCCACCTGTGTAAGCCTG 

TATTTAAATGCTAATATATTTGGCCTGCCGGGATGCCACAT 

TTATTTTCTTCCTTAGCAGCAACAAAAATCATTTATTTATGA 

GAATTCTAGCTCCTACCTGCTCTCCTGAGTTCCTCATCTTCA 

TTTCCATCTACCAGCTGGA 




KNB_2422 
2 


GAGGGGTTTTTAAGATTGTGTGTTCTGAATGGCCTGTCTCTGACTG 
GAACCCAACTCCGTCCCCAGACCCACTTCCATCTTTTTCTGTGAGG 
GGGACACACTCTTTCAACTTTTCCAAAATGGCATCTACCATGGCTT 
TTCTGATTAAAAGCAAACGAAACACACCCTTCCTATAATCAAAAAT 
TTAGAAAAGCAGCAAAAATAAAAAGGGGATAAGGAAGAAAACAGAA 
ATTAACCACCATCCC [A/G] CCGCTAAAATTTTGATGAGTTCTCAT 
GTGTTTCCTTNCAGCTGATTGTTGTTTGGCATACATTTATTAA 


SEQ ID 
NO. 119 


KNB_2425 
9 


CACTTCCATCTTTTTCTGTGAGGGGGACACACTCTTTCAACTTTTC 

CAAAATGGCATCTACCATGGCTTTTCTGATTAAAAGCAAACGAAAC 

ACACCCTTCCTATAATCAAAAATTTAGAAAAGCAGCAAAAATAAAA 

AGGGGATAAGGAAGAAAACAGAAATTAACCACCATCCCNCCGCTAA 

AATTTTGATGAGTTCTCATGTGTTTCCTT [A/G] CAGCTGATTGTT 

GTTTGGCATACATTTATTAATATTGGAATTAAAAATATATATGGCA 

CTTTATATCCTAGAAAATAGTAATACTGTAAATGTGTTCTAGAAAT 

GGGAGCTGCTGTTGCTCTTATTAGAGAATTCAAACAAAGAAGGGAG 

GCTCGCTGGGGACAGCTTCTGGGGGAGGATGGGTACCGCTTTGAGA v 

CA 


SEQ ID 
NO. 120 


KNB 2472 
8 


AGGTATGAGTCAGTTGAGTGGGGACAGGTAATAGAGAGCTAGAACT 
GGCTGGCCTTATGGCCTCCAAGGCATTGGGGAGCCACTGTACATTC 
TTGAGCAGGCAATGACTTCACAAAAGGATTTCTCAAAGGTTAGTCC 
TGCAACAGAAGACAGCGTGGATTGGACTGGAAGAGTGGGAGGGCAG 
GTGGAGAAGGCATTG [G/T] CTGCAAGTGGGGAGCAGCCCTGGGGG 
CCCAGCCAGTCCCCTGTGCCCTGACAAGTGGTATGGCATGGATGGA 
TGGCTCTACTTCTGGGCCGCCAGGATGGACAGGTACTGGTTGCTCT 
TCACCATGGCGATAATGAGGAGGCCACCGGTCAGCAGGAAGGTGGG 
CCAGAAGAGGGAGAAGAGGAGGGCCTGGGGCCCGTAGAGGCGCTGG 
AAT 


SEQ ID 
NO. 121 


KNB_2989 
5 


CCCCATCCCTCCAGTTCAGTACCTGCTGGTTCTGGTCCCGAGTGTC 
CTCCGTGTGGTACAGCACAGCCCACCTGCCGGCAGCTGACACGTTG 
ACCCACAGGCATGGGTACTGGGGCACCTTCTTGCCCTTCAGCT [ C / 
T] CTCCTGGTCCCTGATGTTGGTCTCAATCAGGTGGCACTTGGATT 
C CTGGGTC C ACACGCTGAGGAGAC CACACACATGCACACATACAC A 
TCTCAGAACTGGGTGACACACAGAACACCCATTTGAACCCATTATC 
CCCTGGGAGCCTCTAGAGGGATCCAGGACTGGGCTCCTCATCTTGT 
CTTCAGCATCCAGCAATAAAGGCACAT 


SEQ ID 
NO. 122 


KNB_314 9 
7 


TTCCCTGCACTTGAACCCTAGAACCTAAGAATGAGCATCGTCTTGA 
CCCTGCTGCCTTGAATGAGGGTCAAGGAGAGGGGTGAGTAGAAGGC 
CAGGGTTCCTTACAGATGCCAGACCCTTAGGAGAGGGTTGGGGGGT 
GGGCAGGCCNGGAGAGCTCAGTACCTTTTCTGGTAGAGGGGCAGCA 
CAGTCGTGACCAGGATGTAGTAGGTGATGACGGCACACACCACCAT 
GGTTACACCCAG [A/G] CAAAGGGCTCGTGTCTCTCCCCGCTTCTG 
GGCCATCACCAGCTTCTTCACCATATTCACTGGGGGCAGTGATCAT 
TTCTAGGTCCACAGAAGCAAACAGAAGTGAGATCAGCCCAGTTCAC 
AGGTGATCCACAGAAAGAGAGGACAGGTGAGAGGGGAAGGTACTCA 
ACTATTAATATCACTCTTGTTTATATTTGGAGCTTTGCAACTTCCA 
GAAGTCTTGCTTTTTGGACCCCATGTA 


SEQ ID 

NO. 123 ! 



-134- 



KNB_3529 
8 


AGAGGAAGGGAGTCCTCCTGCCTGCCTCCCTCCCTGCCCCGTGGCA 
GGCTGCTTTCCCC [A/T] GTCTCCCTCCAGCCCGGTCTTCAGAGAA 
ATCACTTCCCAAGTGCTTTCAGGCCCGGTACTCACAGTCTTCCCGG 
CGTCCTGTGGGTCTTGAGCAGCAGACAGTTTCTTTCTGCCTGGACC 
C 


SEQ ID 
NO. 124 


KNB 3537 
0 


AGCCCGGTCTTCAGAGAAATCACTTCCCAAGTGCTTTCAGGCCCGG 
TACTCACAGTCTTCC [C/G] GGCGTCCTGTGGGTCTTGAGCAGCAG 
ACAGTTTCTTTCTGCCTGGACCCCCGCCCCCACCCCAAAAGAGGCC 
ACAGAGCTTCA 


SEQ ID 
NO. 125 


KNB 3539 
9 


AGCCCGGTCTTCAGAGAAATCACTTCCCAAGTGCTTTCAGGCCCGG 
TACTCACAGTCTTCCCGGCGTCCTGTGGGTCTTGAGCAGCAGAC [A 
/G] GTTTCTTTCTGCCTGGACCCCCGCCCCCACCCCAAAAGAGGCC 
ACAGAGCTTCA 


SEQ ID 
NO. 126 


KCP_rs31 
4129 


CTTATCTCCACCCTTCACTTGACCCAAGAATCAAAGAACCT 

GAAACTGAGACTTGGAGGCTTGAAGTCACTGGTGCAACCCT 

AGGGGCCAGAACTAGATTCGAAGCTGGCCCTTCCAGATGG 

CACAGCTTGGTCTGTCTCTGATGACCCTGGGGCTGCTCTGA 

GACATTAAAAATCACCTCGATCATACAGTAAGCTGCCACCT 

GAGGCTCTGGAGGTCACCCTGAGTTTCCCCAGCCCCCAGGG 

AGGTGGGTGCAGCCTGGCCTTCCCTGCTGAGCGAGCTCACC 

ACCTTCCTCCCTCCTGCCTCCAGCAGGCGCGAAATGAAGGC 

AGCCACTCAGGCCTCCCTGACACACTCTCAGGCGGTGAGTG 

CCCTTCTCCACCCCTTTCCTAATTGAATCTTATTAACAGGAG 

ACTACAGTGTCTGTTTAATGGGCACCATAGCACCAGAGGGT 

CTAAGACCAGCTTCAGACCTTGCAGGCAGATTGACAGAGG 

GATGTAGGA[C/T]CTGGAATTCAATCTCAGAAGAGCAATTTT 

CCAAGGATGATCCTCTGTCCACTCAGAAGCAGGAAAAGTCC 

TCCTGGGGCTAATCCAGAAATGCCAGGCCCCCCTCCTGCTT 

CCCTGGGGGAGAGATACACAGTGCAACAGGCTGCCATTTAT 

GAGTATAACCGAAGGGCTCCTTGCTCGTGATACTCTGAATA 

AGTTATTAAGGGCTACATATTATTTGGAAATCATAAACAAA 

CTTTAGCATTCTTCCCAAGGGAAGGTGGGAACAAACAGGG 

AAGGGGGGCCGTGGGGTCTTCTGCTCCCCCTAAATGAGCCA 

CAACCAAAAGGCATTGACAAGCCCTGTCCTCGAGGGTTTGT 

GGGTGAAAACCCAGGTCCTTTGCTGGCTGCGGGGTTGTGTG 

TGACAGATGGCTACAGGTGGAGGGCAAGAAAATAACAATG 

CTGCAACAATAAATATTGACGGTTTGCATTAGTACGGGGTG 

TCAGAGATCACAAAATATCTTC 


SEQ ID 
NO. 127 


KCP rsl8 
3398 


TACTTTTCAGCCTGAGGTCTCCTCCTCACCACTAACACCCTT 

CCCTCCAATCAAACTGATCCATTGTACCTACAAAAAGCCCG 

TCCCACCTCCTAGCCTTTGTTCACAGTGGGTTCTCTGCTGGA 

TCACCATCCCTCCACATTTCCAGGTGTCCCTCAAGACTACTC 

AGCAGCAGCTATCCATACAAGTTCCTCAACCCTGGCTTTCT 

TGCCCTCAAGTAACCAGTTCATCCTCCCCAGTCATATAGCC 

CTCTATTTACATTTCTTTTCTGGAAGCTATCATTTTTCACGT 

GCCATTTGAGTGAGTGTCCTCGCTAAGACGATATTTTCTTTG 

AGGGCAGTAACCTTTCTTATATGTCTCTGTATCCCATGAACT 

TAGCAAAAAACAAGGGACAGAACAGGTGCAAAGTCTACGT 

GGTTAGTGAATTTAACAGATCTTCCTAACGTGTAAACGTCG 

TTGTCCAGGTGAATGGAAGAAGTGAGCTGAGATAGAGGGG 

ACAGACAGAGTCAGTGTCCAGTGCTGACCTCTGAAATGGA 

AAAACATGGCCAGTCCTTAGGAGGCTGCAGAGGCCAAGAC 

CCCAGTGAGGTTTGGGGGTTCCACAGCAGAGGAGGAGCTG 

TGGACCACAGCAGGACCCCGATGCCATCAGCAGGGGAGGA 

AGTAATCAGAGAGGTGGAGGAAGGAAGCCAAGGGAAGTC 

AAGTAAACACCAAATATTCCCTCCCGGTCCAATGCTGTGAC 


SEQ ID 
NO. 128 



-135- 





CTGCATAAGCCACCACTCCCCCAGTCTAGACTCTACCCATG 

GAAGAAGGAAGAAGATAGAACTCTGGATTTGAATATAATT 

CTAAAATAACCAAATTTATCTGAAAATGACTAGGCTGAGTT 

TTCTGCTTCAACCAGAAATGGAGCTTGGAGTCAGAAATTAT 

GTGAAATTATAGAAGAGAAAGTCACCATCTTCCATCTCTGA 

GTCGTATGATCATTTTAGACATAAAATTGTGCACTTACGAT 

GTACCAAGTGCTTAATATA[C/T]GTGATCTCATTTCACCAGG 

GAAACTGTATAATTCATTGCTTTAACTGACAAAATTCTGCA 

ACTGAAGAAGGTGCTGTTAATAATTGCATTGGGACGCAGGC 

CTGAGCAGGCCATGATTTGTGGCTGTCCTACATCTGACCCT 

CACAGTATCCATGGGAGAAGGCAGCATGTTTATGCCCCCTG 

ACAGCTGGGGAAACCAACACTTAAAGTGATTAAGTCACAA 

GTCCAAAATAAATGACAGAGCTGCAGTTCAAGCCCAGGTG 

GTCATTTACCAAAGGCCATGCTCTTTTCACTTTGCATGGGAC 

TGTGACCGCTGGCTCTACCCAGCTTCCCAGTGCGACCCTTC 

CCCGCCCACTGTTTCTCTTCTCTGGCCAACGGAAACACAAT 

GAGACCACATATGTAACATTACATTTTTTCATAGCCACATT 

GAAAAGAAAAAGGAACCAGGTAAAATCCATTTTAATATGA 

TATTTTATTTAACCCAATACAGTTGAGGCTTGAACAACACA 

GGTTTGAACTGTGTGGGTCCGCTTACACATGGCTTTTGTTCA 

GTCTCTGCCACCCCTGAGACAGCAGGGCCAGCCCCTCCTCT 

TCCGCCTCCTCCTCAGCCCACTCTACATGAAAACAAAGAGG 

ATGATGATCTTTTTGATGATCCACTTTCACTTAATAAATAGC 

AAATATATGTTCTCTTCTTTATGATTTCTCGTAATAACATTT 

TCTTTTCTCTAGCCTCATTTACTGTAAGAATACAGCATATTC 

CCAGCTACTCAGGAAGCTGAGGCAGGAGAATCACTTGAAC 

CTGGGAGGCCGAGTTTGCAGTGAGCCAAAATCGCACCATTG 

CACTCCAGCCTGGGCAACAAGAGCGAAACTCCAACTCAAA 

AAAAAAAAAAATAAAAAGAATACAGCGTATAATGCATGTA 

ACATATAAAATATGTGTTAATCAACTGTCTATGTTATGGGT 

AAGTCTTCCAGTCAACAGCAGGCTATTAGGAGTTAAGTT 




rsl03285 
6 


CGCTCAGCAGCCATTAAAAGGATATCATCCAGTCACTTAGTTTCTC 
AATTTAACTTTAAAGGAAAGTTGCCTTATTAGAGAAGTGGCCTCTA 
TTTCAATGTAATGGTCTTTGTCACATCTTCCAATGTGCTGGCTTAG 
TGCTGAAGGATGGGGAAAGGCAGTTTTCACATATTGCAGCCACCAT 
ACCACCAAAGAAAACAGGTGCACTTCCAGGCATCATTTAGCGGGGT 
ACCA [C/G] ATTCCTGGTTCCAGTTTCCTTTTTAGAAAATCTGAAA 
GTAACTTTGGGGCATATCTTTTAAGGAGTACTCCAACACGACTAGT 
GGACAGACCCTAAATTAATTGCCAATCAGCTCTGCCTTCTGGTATT 
TACACCTTTATGTAATAACCTCCACTTGAAGGTAGATGAGATCTGT 
GACTTGCTTCTAACCAGTGGAATATGGCGGAGGTGGTGGGACGTTA 
CTCCTGTGATTACATTACATCATGTGGCTCCTTTATGATGGAAGAT 
TCATGCTAGAGATTCTCCTTGCTGACTTGACAAAGTATGTAACCAT 
GATGAAGACTTCCACGTGGCAAGGAGCTGTGGGAAGCCCAGGTGCT 
GAGACTGGCATCCAGCAAACACCCAGCAAGAAACAGACGTCCTTGG 
TTCTACACATACAGGAAATGAATTCTGCCAACATCCTGAGTAAGGC 
TGGAACTAGATTCTCCCCAAGTTGAGCCTGACAAGTAAAATACAGA 
CCAGCCAACACCTTGATTGCAGTCTTGTGAGACCTGGGGAAAAGGA 
CACAGCTGAACCGTGTCCATTCTTCTGACCCACAGAAACTGTCACA 
TCATAAAGGTATGTTAGTTGTTACACAGTTTAGAAAACTATTACAG 
CTGCTCAAGAAGGTTAGCTAGCTCCAGATTTCAATCCATTCACAGG 
AAAGCAAGCTTTATTCCTAGAAGAATAATTCATGCTTTGCAAAAAG 
AGGAAAACGTCCTGCAGTTTTAGAAGGTCTTTTCTTTCTCAACACA 
CCCAAATTTCTTTAAAATCCTCAAGAAGTGCATTTGTTTTCATGGT 
TGACTCGAAGAAGTGAGTATAATTAACTCACAAAAGGTGGGAGGAA 
GGGACAAATTAAATTTTGGT 


SEQ ID 
NO. 129 



-136- 









KCP_rs88 
8934 


CACTCAAAGGGCTGGGGACCCTTGTCCCTCCCATGTGCATC 

CATCTCTCCTATCTCTGAGTCCCCAGTGAACTGCTGCCTCCC 

TAGAGAAACAGTGCTAGAAGTCAGTGGCAAGAGCAGCAGG 

AGGACTTGGAGCTACATGCAGAGTGTGAGCTCCGGAGTCA 

GACCAGCTGAGTTCAAGGCCAGCTCCACCATCTATTCACTG 

TGACTTCAGGAAGGTTGCTTAACCTCTCTGTGCCTTAGCTGC 

CTCATCTATAAAACAGGAAACAATGAGAGTCTTTCCTTATG 

GGGCTATTGAAATGATTAAGTGAGATCAGGCATGTGATGGC 

ACACAGTAAGAACTCCATAAACAGAGGTCACCACTGCTAA 

TGCAATTATTCTATCACCTCAGGAGACTAAAGCAGGGGAGG 

AAACACCATTGACTCCTGGACATTTACCCAAGGAGATTATG 

GATCCATGTTTTGCACACACTTTAGAAAGACAAGGAATTCT 

AACCACAGC[A/G]TCTGTCTCCACTGCCCCCGTCATTTCAGT 

CTCACCCGTCCACCCTCAACCTCACCACTGTGGCCCGGAAA 

TGCGGTTGCCCAGGGCCACTCTCACCCCACCTCAGCCCTGC 

TCTGCTCAAGTCTCACTTCCACTCCTTCCAGCTCCCATCCCT 

TTCTACCCAGCTCCACCCTGATTTCTCCACCATGACCTTTAC 

CCTCCTAGTCTGATCTAGACCCCTGATCTTGCCGAGTATCTA 

GGACTTTGGTGCCTTTGACCCTCAGCAGCAGAGGTAGAGAG 

GGATCTCGGTGAAGTCTGGGATGTTATAGTGACTTGTTTAT 

CTAAGTGCCCTGAGACTGTGAGTTCCCTAATGCAGGGAGCA 

TCAACCTCTGCAGAGAGCCCCAGAGCCCTGCTCAGGTGTGA 

TGAACAGGAGGCACTCACTTGATGCCCTCACAAAGTTGTGA 

GTGAATGAATGAATGAGTGAATGAATGATTGAATGAAGAT 

TAGTGATTATGTTAATGA 


SEQID 
NO. 130 


rs905823 


GGTGGGGGGGGAGAGGGGAGGGGAGGAGAGGGGAGGGGA 

GTGGGGGAGAAGGGGAGAAAAGCGCAGCTGGCTTCCTCAC 

TCTCCTTTCCTTCCTCACCATCCTTACCCTGGCCCAGGGCAG 

GAGGAGGATTGGCAGAGTAGAGGCAGGGTCTTCTGTCTTA 

GCTGGGCCTGTTGGTGACTTTCTGTTGGCCAACATGGGCTG 

ACTGGAATGTTCTCCAGCATGGCACATGGTCATCCAGATGC 

AGGCTCTTCCCTGGGGCACTATAGCAGAGAGGGCTCTCTTC 

CAGTCTATTGCAGATGGATGCCCTCGTGAGCTGAGTTTTGA 

TGAACATCCCATGTCCCCAGCCACCCCATTCAGAGCCTCTT 

TCTACTCTGGTCCTCTGGTCCCAGCAGCAGCCCTCTGGGTA 

CTGAGGGGAGGGCATCTCACCCAAGCCCCTTAAACCTGCTC 

ACCTTCTTCAGAGCCCACGTGGCCGCAGGAAAGTCACAAAC 

CCTTGTGCTCCCACAGGGCACACGTGTGCACACGTGTGCAG 

CTACCTTCTCTCTAGTTGGTACCTGAGGCTGCCTCCTGGATT 

TTCCAGTCTCTGTGTTCCCAGACA[A/C]CCCCAAGCCCCAAG 

AATACAAGAGCTCTGTCACCAAGCATCGGGCCTGTGGCTGC 

ACTACACGTCTGCAGCTCAGGACCCCTGGCTGCGGCGTAAG 

CTACCAGCATCCCCTTCTCATGGGCACCCTCATCTCCGGCTC 

CCCATCGCTGGGCTGTGACCTGCGGGGGCGCCCCTCTATGG 

AAGGGAAGGAGAAAAATTCACAGTGCTATCTACTCCTCTGA 

ATGCACTCCCACCAATTTCCTTGGAAATTTCTAGCTTTCACT 

GACATATCTGGGATGGGGCGGTGGTCACAAAATCA 


SEQID 
NO. 131 


rs883849 


CTGGCTGGGGGACCATGGGTCAGGGCTGCCACCCCCTGGCTCTGTG 
CCTTCACCTGTGTAACGAATGGGGCACTCACAGCCCCTCTCAAGTG 
GTCCTGGGGATGAAGTGAGAAGGTGACATATACAAGTGAGTTATAC 
ACGTTCCTGTTCTGTCACTCACCAGTGCTCACTGGGTGGGTCACTG 
AACTCCCCTCAGCGTTTCCTTCTCCATCTGTAAACCACCAGTGCAA 
ACCTTTCCCAGATAGTGCTGACCCGAAGCAGGAACCAGTGCCCCTC 
TGCCCTCAGTAAGTCTGCCAGCAG [A/G] GGAAGCCCATAGAGGGT 
CTTGGGAAATGAAGCCAACAGAGTCAAGAGGGTCAGATGATGAGGG 


SEQID 
NO. 132 



-137- 





ACTTCAAGTGCCACCTTCATCCCATTCTTTCTGCAAATATTCACCA 
CACACCTACGTGACCTCAGGCTCTGTGTCAGGTCCTGGGGATGTAA 
TGGTGTCCATGAAGAAACAAGGTCCCTGCCCTCATAGAGTGGCCTG 
ACATATGCCCGAGGCAGTCAGCAGCCGAGTGCGGGAGACTCTTGAG 
CAGAGATTGAGTGTGTTGATATCTGTAGGCATCAGCCTGGCTTTGC 
TGAGTGAGC TATATCAGAGTGGAGGAAGCCAGAGGCAAAGTC CAGA 
CTCCACTGATCCTGGATTGAGGGGAGAAGGGGCTTGGCGGAAGAGC 
AGCCTGAGCACCTGCATCTCACTCCAACTGGTGCTGATTTGTTCCC 
AT 




rs213504 
6 


TCCACAGGTTTGATTATAAATGTGTGTATTGAATTGGAATTT 

CTGTTGAAATTCTGATCCCTTCTAGACAAAGAAGGTAAAAA 

TTGAAACATGTCAATGGATATCTAAATATCATTACTCACTG 

GCTITATTTGCAAATGGCTTTCCATTGACAACAGTTACATTT 

TGTTCAAAGCAACAAATGATTGGCGCTGACAATCCACAGG 

AACATGGTGCAGTCATTAATGAATGTGCTCATTATTCCTCC 

CTGCCGGGAGGCATCGACTCCCGTTCTCCAGCCTGTTTTAA 

GCAGACAGACCTACATCTGCACCTGTCAGCTTGGAACCCTA 

GTAGGGGAGGGGGATGCTGATGTGATGGAGAATGAAGAAT 

GGGCCCTGCAGGCTGACATTTTGGGAGAGTAGGTTCTGAAA 

TTTATCCCAAAGGACATGGAATCCTGGAAGCAGGGTTCAAG 

ATCCTCCCAAAATTGATCTCCCAGGATGCTTGGAATGATTG 

TTC[C/T]GAGGGTTTTGTAAAATGCCAGGGGAAAACCAGGA 

AGCTTCTCTCCAGTTGTCTTGCCTCCTTCCTCTCCAGTCTCC 

ATGGAGCTGACTTTGAGAATTAACTCCTGAGGGACAGAGA 

CCCTGGGATGGAGAGCCAGCCCTGCTGGATTCCACAAGGTG 

CTGCTTAAAGCACAACACCTCTTCCCAATGACAGGTTCTGA 

AAGAAGGCCTTGTAGCTAGATGCACAGAGGGTTTTGTTTTG 

1 1 1 ITriTlTITl A ACCTTTC AGCATCTGTCTAAAATTGCTCT 

GGGCTGGGTACAGTGGCTCCCACCTGTAATCCCAACACTTT 

GAGAGCTGAGGCAGGAGGATCGCTTGAGCCCAGGCGTTCT 

AGACCAGCCTGGGCAATATAGTGAGATCTCTATGTCTAG 


SEQ ID 
NO. 133 


rs50057 


GGATCTGTGCCTGAAGCTGAGCTGCTGCAATGAAACTGACATTTCT 
GCCTTGCAGCCTGGCCATGGGCTTAGCTGGACTAAAATGCTGCTGC 
AGTGGTGAGGGCACGTGAGAGTCCCTAATGTACATGGCCTTGCTCC 
TTGTCCTGACACATCTTTTAGGGCTGCTGCTTTCTCTAGTGCTGGA 
ATCTAGATAATTCCTTTCCCAGCCGTTTGTTTCTTCAATCTTGGAA 
AATATCTGGATGAATGTAACACTGTCACACAC [A/G] AACAGAATT 
ATGACTTACGTCACATTCTATGTCGTGATTTTGTGGACTTTTAATA 
ATTGCATTACATTTGTGACCATTAATTTCCACCATCGCCCTGCTCC 
TGAGAATCTGTAAGGGACATTTGACACTCCTCTCCCCACCCACCTC 
AACATTTGTGCTGACCTGAAGGTCACATTAAAAA 


SEQ ID 
NO. 134 


KCP_4976 


GCAACTTTCCCTGGCCTGCAGGAACTCAGGGACTCAGGGGA 

CTAATAACAACAGTGTATGAGCTTCCGGGCACACTGCTTCC 

CAGTGGCAGCCCCTGTACTTAGGGCTTTGTATGTATTAATTC 

ATTTACTCCAATTCCCACAATAACCCTATAGGGTAGGGTTT 

TATTATTGATTACCTTTTTACAGAAGAGGAGAGTAAGGCAA 

AGAGAGATAGAGTAGTTTTCCCAAGGTCAAAGAGCACATA 

AATGATAAAGGATGGATTTGAATGTAGGCAGAATGACCCT 

CAATACAGACTGTTCCTACAGTCCACGTCCTCAGCCACTAG 

ACCATACGGCCACTGGGATGATAGACAGACCACTGCAGCC 

ATGGATAAGGCAAAAACAGGGCTGGCTGTGTTGATCTGTGT 

CTCTCAGAGCTCCATTCTTCCTCAAGGGGGCACCTTGCAAA 

AAAAAACAAAAAAATGGGGCAGGGTAGGGAACTGAAGGC 

AGGAGCTCTTCA[C/T]AGAGCATAGCCACATCCTCCAGGCA 

GACAAGAGGACGCAGGAGGCACCATTCTGTGAGAGTATCA 

CAGTCTGACCCAAAGACACAGCTTCACACTGTCTGATGGCT 


SEQ ID 

NO. 135 



-138- 





TGATGGTTAATGTCACTCTGCCTTTTCCCCTTCTCAGGACTT 

TGTAACCGCTCTGTCGATTTTATTGAGAGGAACTGTCCACG 

AGAAACTAAGGTGGACATTTAATTTGTATGACATCAACAAG 

GACGGATACATAAACAAAGAGGTAAGTGAGCTGGGGCCAG 

GGGTGTGAGAGGGCTCCAGTGAAGGTAACTAACCCAACAG 

AAAACAGCCCCAGGCATGAGGATAGCACTGTCTGAATGAG 

GCAGGCTCTGCTTTGGGGCTAACAGAGCTGGTCCCTGGCAA 

AATAAAGAAGGCCTCCCTCATTGCCCTACCCTGCCCTGTTC 

CCAAGCGCCCAGAAAGGATTAAACAGATTCATTCTCACTGG 

GTCACCTAGATTCAGTAGATATTACAC 




KCP_5077 


TAGGGCITTGTATGTATTAATTCATTTACTCCAATTCCCACA 

ATAACCCTATAGGGTAGGGTTTTATTATTGATTACCTTTTTA 

CAGAAGAGGAGAGTAAGGC AAA GAGA GAT AGAGTAGTTTT 

CCCAAGGTCAAAGAGCACATAAATGATAAAGGATGGATTT 

GAATGTAGGCAGAATGACCCTCAATACAGACTGTTCCTACA 

GTCCACGTCCTCAGCCACTAGACCATACGGCCACTGGGATG 

ATAGACAGACCACTGCAGCCATGGATAAGGCAAAAACAGG 

GCTGGCTGTGTTGATCTGTGTCTCTCAGAGCTCCATTCTTCC 

TCAAGGGGGCACCTTGCAAAAAAAAACAAAAAAATGGGGC 

AGGGTAGGGAACTGAAGGCAGGAGCTCTTCACAGAGCATA 

GCCACATCCTCCAGGCAGACAAGAGGACGCAGGAGGCACC 

ATTCTGTGAGAGTATCACAGTCTGACCCAAAGACACAGCTT 

CACACTGTCTG[A/T]TGGCTTGATGGTTAATGTCACTCTGCC 

TTTTCCCCTTCTCAGGACTTTGTAACCGCTCTGTCGATTTTA 

TTGAGAGGAACTGTCCACGAGAAACTAAGGTGGACATTTA 

ATTTGTATGACATCAACAAGGACGGATACATAAACAAAGA 

GGTAAGTGAGCTGGGGCCAGGGGTGTGAGAGGGCTCCAGT 

GAAGGTAACTAACCCAACAGAAAACAGCCCCAGGCATGAG 

GATAGCACTGTCTGAATGAGGCAGGCTCTGCTTTGGGGCTA 

ACAGAGCTGGTCCCTGGCAAAATAAAGAAGGCCTCCCTCAT 

TGCCCTACCCTGCCCTGTTCCCAAGCGCCCAGAAAGGATTA 

AACAGATTCATTCTCACTGGGTCACCTAGATTCAGTAGATA 

TTACACAGTGGATAAAAATGACTTGTTTCAGTGTGAAGAGT 

TACTCTTCCCTAGGGAACCTGCATTTGGGAAGGTTAGGAGC 

CACAAGTCAAAGCTAAAAGTTGAAA 


SEQID 
NO. 136 


KCP 2410 
99 


TTGAAAGAGAGCGCTTTGGGGGGTTTTCTTACTGTATGTCT 

CTATTGCATGTTCTGTATTTTACATTTTTCTATTATTTCTTCT 

CTGAGGTATAGTATTGAATGTAGAAAAATCCTCAAATGTTC 

GGTATTAAGCAATACACTTCTAATTCATGGTTCAGAGAAGA 

AAATATCTCGAATAAAAATAAAATAAAAATATGACTTATCA 

AAATTTGTAGGATCTAAAGCAGTATTCCAGGAATGCAAGGT 

TGGTTTAACATTCAATAATTGGTCAGTGTAATTAATCACATT 

AATAGAATAAAAAGAGAAAAAATATAATCATTTCAGTGGA 

TGTAATTGTTCAGAGCTTCTTAAAAGAAGCAACTCACTATT 

TTACTAGATGATTTGTTTCITCTGAATTCCTCTTTAAGGCTA 

CAGGTGGTGCTTCTTACTTTGAACTGATCACTTTCTAGGTCC 

CCACCCTTACTrCTTGTTTTTCATACCCTTGTAGAGTTTTCTC 

CA[C/T]ATAGGAAACCCATGCTTGACATTTGCTCACCAGAGT 

TACAGAGCTCTCAGGGAGGAGACTCAGAGTTCTAACCCTCT 

TGCCCTCC l 1 1 1 1 1CCCAGGACGACAACATCATGAGGTCTCT 

CCAGCTGTTTCAAAATGTCATGTAACTGGTGACACTCAGCC 

ATTCAGCTCTCAGAGACATTGTACTAAACAACCACCTTAAC 

ACCCTGATCTGCCCTTGTTCTGATTTTACACACCAACTCTTG 

GGACAGAAACACCTTTTACACTTTGGAAGAATTCTCTGCTG 

AAGACTTTCTATGGAACCCAGCATCATGTGGCTCAGTCTCT 

GATTGCCAACTCITCCTCTTTCTTCTTCTTGAGAGAGACAAG 


SEQID 

NO. 137 | 



-139- 





ATGAAATTTGAGTTTGTTTTGGAAGCATGCTCATCTCCTCAC 
ACTGCTGCCCTATGGAAGGTCCCTCTGCTTAAGCTTAAACA 
GTAGTGCACAAAATATGCTGCTTACGTGCCCCCAGCCCACT 
GCCTCCAAG 




rsl89530 
1 


TTTTTTTTCCCCAATCATGCTGTATTCTTAGCGTAATTTTAAAATA 
CTTAAAACAAGATCATGAGAAAATAAATGCCCAGATTCTAGCACCA 
AAATTCAGAAGGGGGGGCTATGAGAATGAGGGGCGGGGAGAAGCCT 
TCCTGAGAGTTTCTAAGAGGCATGGAGGCAGTGGGGATAGTGATTA 
GCTCTGGGGGAAGAAGAGGCTACTGGCTGGAAAAGGGCATGAGGTA 
GGGTTGGTAATCACCTA [C/T] TGTTTTATCTGAGTGCTGGTCACA 
CAGATGTGTTCACTTTAGGAAAATGTATTGAGATTACACTTGTGAT 
TTCTGCATTTTTACATACGCACATTAACTcagt catatgctgataa 
atgtttaacaatgggtttgctggagaaaaaagggtcccccggattt 
gtaatgtctgcccatttccgtggtgtaaatactcccttcacaactg 
atttcaagcttcccatgcactgtaactgaagacagagttgggaaga 
tacgtgcagtagcacaacattaaatcatatttccaccatatacaca 
caataggtgtaaataacacccagagcatagaaaa 


SEQID 
NO. 138 


rsl42275 
2 


GGTTGGCAGCTTTTAATAACTTAGAAATGGCTGGGGGTGGGGGGGA 
GGAAGTACTGAATCATTTACTCATTCAGCAAATAACCAGGGAATAC 
CTACTC TACACTGGTCACTGATGGAGATA [C/T] AGACTTGGGCAA 
AAGCCGCGTCATCTGGTTGTGTTCAAGCTGAACATTCCCTTGACCC 
AGTCACTGATGGAGATATAGACAGGCAAAAGCCACGTCATCTGGCT 
GTGTTCAAGCTGAACAGTCCCTTGACCCAGGGCCCATGACAGGGCA 
GAGGGCAtattattatccccattttacaaaggaaagagctgtcaga 
cacagTGTCACACAGGAAGGTAGACGATAATGTCAATATCCCTCAT 
CTTAGTATAAAGTTGTCCTTAAAAACTCTCCATTATTTATTAATTT 
ATTGACTCACTTATTCATGTTTTCTGCACAGTGATACTTATCCTGC 
ACGAGACTCTCACACCAGTGCTTTGGGTGTAAGAACACCCCAAGGA 
TTGTGTTCCCTTTTCTCGAAGAGTCTGTGGTCTAAGGGGATTCAAT 
GGGGTCCACTTTCCAAACCAAGACAGCAAAGGAACACTAGGAGAGA 
AGTATTCTGTGCAGAGATTCAGTTAT 


SEQ ID 
NO. 139 


rsl42275 
4 


GGATTAacaggcatgcaccaccgcacctggctaatttttgtatttt 
tagtagagatggtgtttcaccatgttggccag [A/G] atggtcatg 
atctcttgaccttgggttctacccacctcagcctcccaaagtgctg 
agattacaggtgtgagccgctgcacccggGCAACTGGTTTCCTTTT 
ACTGCCACTTTCACTAACCGTGGTATTTCTCCATGGGCAGCATTCT 
TGGCATTTGGGTGTGTAGGACTGTCCCTCACATAGTGACCTCTTAC 
TCATGAATTGCCAGTGTCACATTCAGATTCTTATGGCAACCAGAAG 
CTCCCCTGCTCCCAGCATTTCTGGACTCAGCCTGGGCTGGGGAGGT 
TAGCTCAGACCAAATATCTCCTTTCTGCCAGTTGCTCTGCTAGGCC 
CAGGTCATGCTGAGCAGAGCAAGATGTAGCTGAAAACCAAATAAGT 
CACGTGTTCCAGCTTGCTGGGGTTTTGTGAAGAAAGCAGCCACCCC 
TCCAGTCATATAGTTTGCAGGTTGGGATTTGCATT 


SEQ ID 
NO. 140 


rs205560 
6 


tggctattgtcttaagctactattaccttcttgcttgtcaagttgc 
gcatttacttttcaaggcttgctacgtgcctggaatttctagattt 
tcctttatttccatgcttggggagaggagtgcctggcaggctccta 
agagggg t ctgtgctccatct cGCCCCCTATCTTGAACTATCGGTT 
GGGTGCTCTAGAATCTGTATGGGGTGGAAGTGTTCATTCATTTTCT 
GTACAAAAGCAATCAATGCTTATTGTGGAAAACCCAAATAAGAGAG 
TTGCTCTAAACAACACCCTCCCCAGTCCCAATACCTTGTCCAGAAG 
AAACCACTGTTTGGTGAGTATATTAGT [C/T] AATGTCTGCAGACC 
AGATCGGATGACCAAGTTTTCCATAAATGGATGGCCATCCACTTCC 
CTTCAAGGGCGAGGGTAGTTTGTTCTGATCCATCTCCCTGTTTCAC 
AGCTCAGGGAGGGAGGAAGACCCAGGAAGGAGAGCTGCCACAGTTA 
CTAGTGGCCCAGCTGGGATTTAAAGTCCGCCGTGACTGAAGCTTGG 
CTCCACATGCCAGTCTGCAAGGCCCTGAGTGCCCTCAGCAGTAATT 
CCAAGCAAAGCAGGGAAGCAGCGGGCCAGGTGCTGAACTGAACTGC 


SEQ ID 
NO. 141 



-140- 





TGCTCAGGGCTCCTG 




rs933656 


CTGCATATGTTCCCCCAGGTATTTGCCCCCGAAGCACAGTCATCTC 
ACTGCCTTGCATAGTGGAATGCTAATCAGCAGAAGACCCTTCTATG 
GGAGGCAGCTTGGAAACCTGGAGGAAGCCCTGGCTGAGGAGGCTAG 
TGGTCAGGGAGCCTATCCTGGCCAGGTCACTTTTCCCCACTGGGGC 
CTCGGTTTCTTCTTTGTAAAGGGAGAAACTTACATTAGGCATTTCC 
TCAGGTTCCATTTGGTTCTCAAATTCTAATATTTTTATGGTTGATG 
CTCTCACCAGAGCTGCTGCTATGATCTCAGAGACGTGAGGCTCAGA 
TCTAATTAGAAGCAACCGGAAGAGAGCAGTTGGGATTTTTCAa C t C 
aggaatcagtctccctgctgggttcaaattcaggctctgccactta 
cagctgtat ga cTAAGCCTTGTTTTCCTCAACTATAAAACAG [ A/ G 
] GATAGTAGTAGTTACCATCTTAAAATAGCTGTTGTGTTGTGTGGA 
TTTCAAGGATCATGCAAGTCAAGCATTTAGCACAGTCTCTGCTACA 
TAAGTGGTCAGCAAATTTGAGGTACTATTC 


SEQID 
NO. 142 j 


rs233909 
1 


AGACCCTTCTATGGGAGGCAGCTTGGAAACCTGGAGGAAG 

CCCTGGCTGAGGAGGCTAGTGGTCAGGGAGCCTATCCTGGC 

CAGGTCACTTTTCCCCACTGGGGCCTCGGTTTCTTCTTTGTA 

AAGGGAGAAACTTACATTAGGCATTTCCTCAGGTTCCATTT 

GGTTCTCAAATTCTAATATTTTTATGGTTGATGCTCTCACCA 

GAGCTGCTGCTATGATCTCAGAGACGTGAGGCTCAGATCTA 

ATTAGAAGCAACCGGAAGAGAGCAGTTGGGATTTTTCAactc 

aggaatcagtctccctgctgggttcaaattcaggctctgccacttactagctgtatgacTAAGC 

CTTGTTTTCCTCAACTATAAAACAGAGATAGTAGTAGTTAC 

CATCTTAAAATAGCTGTTGTGTTGTGTGGATTTCAAGGATC 

ATGCAAGTCAAGCATTTAGCACAGTCTCTGCTACATAAGTG 

GTCAGCAAATTT[G/T]AGGTACTATTCAATTTATGGCTCTAT 

TGTTTGGGGCTTCCAAATGTCCAGAGTAAGGCCATTTTCGA 

AGTAGGCAGTACATCTGAGAGCCTTAACAGCTCATTTCTGG 

AAACCTTATCCAGCCCTATCCAGATAACTAGGACCAAAAAC 

CCCAGCACACAGATGCTCGTCCCTTGCTTCAACCCTCACTG 

ACCTCTACTCTGTGGCTTGTCCTGAAAACATCAAAGCCTGC 

TCAATTAAAATCCTGAATGCCTTGATAATACAATTTAGAAA 

CATACATAGTTrj'1'AAATAGGGCAAAAACTCTGCATGATTA 

GTGCTGCAAGAAGATATCCAGCCCAACCTGGGTGTTCAGGG 

AGCGCTCTCTAAAGGCAACAGAAATCTAAAGTAATTTAAG 

AGCCATGCCACTGAATAAAAATATTCAGGTTCATTTCCTGT 

CCTTCTCTCTGTTTGGGATCTTTGTGTGTCTTTAATTAAAAG 

TAGGAGAGCCCTGCTTTT 


SEQ ID 
NO. 143 


rsl86233 
1 


ACTACTTCTAAAGCCTCTTAGACCCTGGTAATCTTCCTCCTAACAC 
CATCGGGTGACTGCAAAGCACTGCAGGCCAGACTTCAGTTCTGCTG 
TGTAATTTGCAAGCTGGGTGACCTTCCTTATCTATAG/^ATGGGCTC 
T [C/T] CTGCATGGCTGGCATGAGGAATAAACAAAATGGTTGTGTC 
CAGTGCCTGGGGCATAGCACAGCTCAAAAAACTTAGTTCATCCTCC 
TGAGGGATCAAGAAGATACTTGGAAACAAATGTCCAAGGGCGTAAT 
CTTGAAGGGGCTTGTGCCAGGCATATATGGAGAGAAGGGTTTTGTG 
GGATGTCAGACTTAATAGTGCCCTTTACTCCCCACCCCCGTCTCTC 
TGTTCATAGACAGGAAATCTGTGGCCTATTCTGGGACCTCAAAGTG 
CCACAGGGTTAAAGATACCAAGTCAGAAATCTAAGGTTCTAAATGG 
ACTTTAGACCATTTTTCATTTGGGAAGGAAGAATTCTTTAAGGGGT 
TGTGCTGGCGCTGTCTCTGTATGCATGTGCAGAATGTGCTTCCAGA 
TGGGGTAATGGTCTGAGTTTGAGGACAGAAGTCCACTCCACTGCAT 
TC 


SEQID 
NO. 144 


rs233913 
9 


GGGTGTGGCCTTTGGACAGCACCTTAGCAGGAATGTGGTGG 
AGAGCAGCCCCATTCACTCCAGAGGAGAGCCTCAAACTCTT 
CAGGCAGATCTAGCCTAGGTAGAATCTTGGCCTGGCCCCTC 


SEQ ID 
NO. 145 



-141- 





CGGGATGACAGGTGCCATTGCCCAAGAATGGGGAAAAGGC 

TGAAGTGCTCCAGCCAAAGACCCCAATTTATCTTCAGGACA 

ATTTTCACTGGAAACCTTGCCTCACCACTGCCCACTTTTTCA 

GAAGTAATTAGAATGCTAATCTATAAGAAAGATGACtattaaaa 

ataaattaataataGAT A ATA C ATTTTGGCTTA C AATTTTG AATAAT 

ATAGCCATCCCATCTTAAAGTAAAAATTCATATATTTTTAAT 

AAGCCTGAGACATGTTTTCCAATGAACCACAGATGGTTCAT 

TTTTATTATCCTATAAAGAGACATTATGGGCAAGTGTTTTTT 

AAAATGGTAAAACAGAACCTTAGAGCAGCTCTCTTTTG[A/G 

]AGATCTCTAAGCACTTTCTAAGCATCAGGACCCCCTTCTGT 

CATCACAGAGACTGAAATGAGGAGATGGTCTCTGTCACCCC 

CTCACTCACCAGTGAGCCCCAGACCTTCATCCCTGATCAGA 

TGGAAGCAGTGTGGCATGATTACAGTTCATATTTCAACTCT 

GCCACTCAATGACTAATAGCCAAGCACTAATAATGCAGAA 

AATGTAAATTTAAAAAATAATCTTCCTGAGATTGGTTATGA 

AATGCACTCAACACAGCACCATCCACAGAGAGGTTCTTTTT 

AATTGCTCTTTTCTTTCCTCTCGACACCCAGAATCACAAAGC 

ATGCCTGAAAGCGTCACACATATATGTCTGTGACCATAACA 

TGGCATTGCACATGCAAAGGAAATAAATAGGTGTTACCCAT 

GTGACAAAGGTCCATGAGCTCTGTCCGCAAAAAGCTGTTGA 

GTTTAAAGAACAAATAATTCTGAAAAATCTTCCAG 




rs872435 


CTGCCATTCTGATCACTGCAAGACCCCCACCCCCAATACTCCCAAT 
TGTACCACCCCACCCCACTCACCAGTGTCTCAGAAATGCCTCCTCC 
AGAAGGAAGGCATCCTGTCTAACCCACTGCTTCTAGCCAAGCTGTC 
TTTCTTCAGAAGGTAGAAAAA [G/T] ATTGTTAGTCATTGTTTAAT 
CTTTATTGAGTATATACCGCCACACCAATTGCACTGCCATTCATTA 
TCTCATTTAAATCTGACAAGAGCCTTGTAAAGTAGGGATTATTCCC 
ACCATTTCCCAGATGTTGAAACTGAAATTGATAAACACGACATGTT 
GCCATGGCTACATGAAGATCTCCAAGCCGGAGGATCTCCACCCTCA 
CCTGCCTAGCTTCCCAGACCTCTCTGCAGAAAAGGGACTGACCCCC 
AAGACAGCCCTGGCCTCTGGGCTCCACCCCTTCCACATCCATCCCA 
GGGCCGCTGAGGACTGAAGAGTTCTCCACGTTTGCCCTTTAAAGTG 
ACTTAAAAATAATCTTTATGAATTTCTTCATATACAAAATTTGTAC 
TTACTCATTGCAGCAAATTTAGAAAATACACATAAGCAAAAAAGAA 
CGTAACAGCCATCCATAACCCTAACTCTCAGAGATCACCACTATTA 
AAATGTTTATTATCTAAGAGAGAGATGATATAGACAAAGATGAGAC 
AGATTGACACAGACAAGATGGGTACATGATAGATATTTTCTGTGTT 
ATAACCCTTGCTTTTTCTTGCACTTTCTAGAATTTTTCTGAGAACT 
AATCTGAAATCTGCACAGGGTCCCCACGTTTGGATCCTCTATCCCA 
TTGCCTTCCA 


SEQ ID 
NO. 146 


rs329468 


AGCTGAGCCCCAGGGCTCCCCCATGAGTGGGGAGGAAACT 

CATGAGTGCCTTCTATATGCCAGCGCTCTATCTGCAGGGGT 

TCTTTTGATAGCAGCAGACTGAGAGATGATGTTACTGTCCC 

CTTTTTCCTGTTGTTGGCAACTGAGACTCAGAGGATGGAAG 

TGACTTGCTCAGGTCCACCACCTCTTCAGCTGTGGAGCTGC 

GACAGGAGCCTTTGTTTGACTTCAAAGCTCACCATCACTCC 

TCTCTCACTGATGCTCAAGTGGGCTATCACCTCGCCTTTCCT 

GAGCCTTCCTTCGCTATCCTAAAACAGCGCCTCCCGaaatcacca 

ctaaagaacttattcatgtaaccaaacaccagcggttcccctaaaaacctatggaaataaaAATT 

AAAAATAAAAACAGTgcctcccatgacccatgtctctccagtcccataactctgctct 

atttccattcacagctccatccccacctttatgtcttttgttcactgctttatccccagtgcctagaagagt 

gcttggcacctagtagacactcagtaagtatttgtcgaatgagttaatAAGGTTGTGAAA 

AGAACGTTAGATTACTGGAAGGATTCATCTGAGTTTAATTC 

TGCTATGCTGGGAATCCAGTGTGCGGCCTTGGATGA[A/G]G 

CCAGTTCCCTCCCTGGGCCCCAGTAGCCACATCTGTACATTT 

AGAGGGCAGGAGAAAAGCCACACGCTCTGTGACTTATACA 


SEQ ID 
NO. 147 



-142- 





ACTTGTTGCCCAGAGTGGAGGCTGCTTTGATGCTCAGAAAA 
AAGAAACAAACATGGAAATGCTAAATGGGTGGCAGAGAGC 
TTGAGGGAGGAAGGAGATGGGGAGGGTACTCTTGAAACTG 
TTTGGTGTCTTCCCTCCTGCCCCCTCAGTACCAA 




rs50364 


GCCTGACAGATTTTTACTGAAGGGTGCACATTGGAATAAAAAATGT 
GTTACCTATCTGGTTGAGTCTTCAGCTTCAGAAAGGTAATAGAGCA 
AAGGCAGATAAATCCAAACAGGGACTGAGCTGTTTTCATGCAGGCT 
GCCTTGGTAGCTCTCCAAAGCCTTCAAAAATGATGAGATTTTTTTT 
TAAATCCTTTTTATCC [A/G] GTTGTTCTCAAGGGATTCCACCCCT 
GCATAGGAGAGCTCACCATTCCTGGGATCTTCAGCTTCTATGCCTT 
TGCATATGCTCTTCCCTTGTTCCctcattcttcaacactcaactga 
attatcacctcccttgaagccttctctgacatcccTTCTAGTCCCA 
TGCCACCCAGGAGGCACTAAGAGCTTCCTCCCCTCAGCTCCCAGTT 
CTTAAACATGTCAACACTGTTTTGAAATGATTTGCCAATGAAAAAT 
TCTAGACCAGCAACCAACAAcatccttcccaaaggtgtgttatata 
tggtacatgctctatgtgctaaacaccaaattcattgataacagct 
aagaaccaggaaacaaaccatcgttaattatggcatctcttgaaaa 
atctaaagatctggactcactgggcttaaatgactgcatgataaca 
actggttgagtaacaactgtttccctttcatggagcagttactctc 
cagttctcagttcctaccactctctatagttgtacactcatcatct 
gtcctcatctgaattacctgccaatgactactggcatttgagtttc 
taatccatgGTCTATGTGTATGCCTCCTCACCAGTGTGAGAACTCA 
TGTAAACAGGTATTATGTCTTTTCATCTCTCTCCTAA 


SEQ ID 
NO. 148 


rsl55158 
3 


ATAATGGTCACGTTGGAGCAATTGCCATTTCAAATCATTAG 

GAACACTCAGGTCACTTTGGCATGGAGCTATTTTGTAAAAG 

ACGTAGAAGCCATTTATAAACTTTGGTTTGCTTTTTAAAAAT 

TTATTTCATTCTGAGGCTTATCCGTGTAAAATTACCAAAATG 

ATTGTGGTTAGACTCTACATTGTCACAGTATTTAAATGTGC 

ACAATATTCCACTTAGAAATAATGTCAGTACTAAAAGTAGT 

AGAGGGCTTTGATAGCAATATTAATACATCGTTAAGCCCTT 

CTCATTAAACAGTGTAATAGTCTTGTTGAAGTTTGTTAGGC 

ATTTTAACCACTACTAATTAAAAATAGACCTACTGACTAGT 

CTGTTTTACTGTGCTTTATTGTGTCTTGGATGTTCATTCAGA 

TACTTTTGCTGTTGAGAAATCAAATCGTCTCTTATGGTTTTA 

ATTACAAAATACATATTAGAGGGATACAGTTCTTAGGGCTG 

TGATTTTTAATTTGTGTAACCTTTTTTTATTTTGGAAAGGAA 

ATTTCAGATTTTTTCTAGTAATTTTTCATTTGTGAGTGTTGTT 

TTCTAGATACAGAAAATGTACCTAGATAGATGATCACATTT 

TAGGATATTTTGCTTACGTGTTATTTTATATTTATATACTAT 

AATACCATTGTATAGTTCAGAACAAGAAAATATCTTGATAA 

ATCATCTGCTACTGTGAGGCAGTTAAAAAAATTTGAGGCTC 

ACTGAAAATGTGTGACTTGCCCACTGTCTCATATTGCTAGT 

ATTGGAGAGAAAACTAGAATCTAGGCCTTTATTTTCCTGAT 

GTAATGATTTTAGCTAATTATTATTTATTTTCTTAAATCATT 

GCATTAATT[C/G]ATTTTTCACAAGTAGAGCCTATATCAGTG 

TTTGCaataattaaattttaagtatatttctataattgtaaataaaatCCTGACATTTGTT 

ACAGGATGGGGTTTTCTTTCATCatatttttataataaaaattaaGCAGTT 

ATAAAAATAAATAGCCTAGTTTTTCAATTGGTATAAGCTGG 

CTTTATTTTATACTGCTAATAAAGGCACATTATGTTCAAGCA 


SEQ ID 
NO. 149 


rsl45769 
2 


CttatatattcattaattaataatttatattCACACAATGATTGTA i 

GAAATGTGAGTGTTTCTTAGATTACCAAACATCTGTGAAATCGTGA 

AGGAGTATTGAAATTTAGTAATTTGGTTTGGATCTTTGAAGATATT 

CTGTAGAATTGTTTTCCAAAAGTTACAACTGGTTTACAATTTTTTT 

CTTAATTGCCATTAACAAGTTTTGACCCTGAGATGAGAAATTATTC 

ACAAATTTCAATTAAATACTGGAATGCTTCATATTTTCTGTACTTT 

AGGAcagggatccccaacccccaggccacaggttggtactggtttg 


SEQ ID 
NO. 150 



-143- 





tgacctgttaggacctggactacatggcaggaggtgagcggtgcgt 
gagaa [A/G] cattactgcctgagctccacctcctgtcagcgacag 
cattagattctcataggaggacggaccttattgggaacacacacaa 
gagatctaggttgcggactcctcatgagactctaatgcctatgatc 
tgaggtgggacagttttatcctgaagctcccccactatccgtccag 
ngaaaaatttggtcccttgtgccaaaaacactggggacctctgCTT 




KCP_1035 
5 


AGGGCTGGGCGTCCCCCGCCCCCACCGTGCAGCCCTCGCCCCCGCC 

CCGCCCCTCCGTAGTTGCCCGCCCGCCGCCCCCTCCGCCGCCCCCT 

CCGCCGCTCCGACTCTCGCCCCGAGCGCTGGCAGCAGGCAGCAGGC 

AGCAGGCGGGCGCGCTGTGGCTCCGCGCCGCGCGGTCCGGGCTCTG 

TTCATTCATGATTGGTACTCGGCCCTCCGAGACCCAGCCCGAGCGC 

AGGGAGGGGAGCCGAGTGTGCGGCAGGAGGGGCGGGCGGACGGCGG 

CTCCCGCACCGCACGCGGCGCTGGCTCGGCAGCCTCGGCCGGGCGG 

CCGCTCTGGCCCCGTGTCCAGTGCCAGGCAGGCTTCAGGGCACCGT 

CCTCGGCCCTGGGCGAGGGAACCGCCGGGCCGGGTCCTCGCGCGGG 

GAAGCGGTTCCGAAGGCTCGCGGGGAGCGGCTAGCCCTGAGTCCCT 

GCATGTGCGGGGCTGAAGAAGGAAGCCAGAAGCCTCCTAGCCTCGC 

CTCCACGCTTGCTGAATACCAAGCTGCAGGCGAGCTGCCGGGCGCT 

TTTCTCTCCTCCAATTCAGAGTAGACAAACCACGGGGATTTCTTTC 

CAGGGTAGGGGAGGGGCCGGGCCCGGGGTCCCAACTCGCACTCAAG 

TCTTCGCTGCCATGGGGGCCGTCATGGGCACCTTCTCATCTCTGCA 

AACCAAACAAAGGCGACCCTCGAAAGGTAAGCCACCTTCTTCCTTT 

TGTTCCCCTGTCTGGGCTTGGGGGTGCTAGGCGCCGAGGTGGGCTG 

TGCCACCTGCCTCCCTTAGTCCGGACTCTCCTCTCCACGAGGAGCC 

CGGACAGGTGCTTGTATCCAAAGGAGAGAGAAATCGGCGGGAGGGC 

TGGTGTGAACACCCAGAGGAGGGAGCCGGAGTGGACGTCTGCCCCA 

GCGGCAACTGGACCCCTCTGGGGCACCAGGTGTCGGGACTCTCCTC 

CTGGGGAAATCTCTGAGAGCCGAAGGAAGCGGCA [A/T] GTTCACA 

GGTGGGGGTGACCGGATTCTCTGGTGGAAGTGTGGTGAAGCTCTTC 

CCATTCCCATGACAGCTGGCGTTTGAGCACTCAGTGAGGGTGCTGC 

CACACTCCCACACTCCTCCTAGGCGGCTATGCCAGGTGCAGACCTG 

CGAGTCCCTTCATCAGGAAGAGTGCTCTGTCTGCACCCCCAAAACC 

TCTGCAAGCCAAAAGGAATCAGCTGCTGCCAGGGGTAAAACTCCCA 

GGCCTCATGTCCTGGTGGCTCCGGGAGTCAGGAGGAGCAACCGTGA 

AGGGCTGGC TGCGAGCTGAGC TTACATCAAGGATTAAAAAGCATAA 

TATCGTGGAGTCTCTTCTGCCTGGACGCTGTTCCTTCACCACCTGT 

CCCCAGCCGAGGCATGGCTGATCTCACCATCCGTGGGAGAGTCCTC 

AAATGGGTCCAGGTGAAGTTGGAACCAGTGTGTTGGGCCCTGGAGG 

ACAATGCAGGTCTCCTTACCAGCAGTTCAAAAGTTAGTGGTTGGAA 

TAAAGAGACTGGAAGCAGTTAGGAAACGGGAAATGATGGGTTTTGT 

TTTGTTTAATGTTCAAATGTCACTACGAGTGGTAAGATTTTAAGCA 

GCTTGACACTTAAACATTCAAATTCTACCATCAGAGCCCCCATCCT 

GGATACAGGTGGGAGTTAAGCTCCTACCCTACAGGCCTGATAGTGA 

GTAGAAGTGTAATGGGGTAAGGGACCCCAAGTGAACAATAAGTCTC 

CTCTTAGAACTTGGTTGGTCTCACCCTGTTTAGAACCACAGAGATC 

TCCATAAGTAAGCTGTCCTTGAAACCCCCTGGAAGAAGGGGTCCCA 

GCTTCTGGCCCAGCTCCCAGGGGCATCAGGCTGGCTGAGCCCCGAG 

GAAAGAGATCTCTGGGTGCAGATCTTAGGTGCTGAAGCTGGGTTGG 

CATTTACATCCTAGAACATAGGAAGAGGCTTTGGCCCATTTGTCCA 

GCTGAGTTACATGTCCTGCTGGCAAGG 


SEQ ID 
NO. 151 


KCP 1044 
6 


TGGGGGTGCTAGGCGCCGAGGTGGGCTGTGCCACCTGCCTCCCTTA 
GTCCGGACTCTCCTCTCCACGAGGAGCCCGGACAGGTGCTTGTATC 
CAAAGGAGAGAGAAATCGGCGGGAGGGCTGGTGTGAACACCCAGAG 
GAGGGAGCCGGAGTGGACGTCTGCCCCAGCGGCAACTGGACCCCTC 
TGGGGCACCAGGTGTCGGGACTCTCCTCCTGGGGAAATCTCTGAGA 
GCCGAAGGAAGCGGCATGTTCACAGGTGGGGGTGACCGGATTCTCT 
GGTGGAAGTGTGGTGAAGCTCTTCCCATTCCCATGACAGCTGGCGT 


SEQ ID 
NO. 152 



2345.2049-005 



-144- 





TTGAGCACTCAGTGA [C/G] GGTGCTGCCACACTCCCACACTCCTC 
CTAGGCGGCTATGCCAGGTGCAGACCTGCGAGTCCCTTCATCAGGA 
AGAGTGCTCTGTCTGCACCCCCAAAACCTCTGCAAGCCAAAAGGAA 
TCAGCTGCTGCCAGGGGTAAAACTCCCAGGCCTCATGTCCTGGTGG 
CTCCGGGAGTCAGGAGGAGCAACCGTGAAGGGCTGGCTGCGAGGTG 
AGCTTACATCAAGGATTAAAAAGCATAATATCGTGGAGTCTCTTCT 
GCCTGGACGCTGTTCCTTCACCACCTGTCCCCAGCCGAGGCATGGC 
TGATCTCACCATCCGTGGGAGAGTCCTCAAATGGGTCCAGGTGAAG 
TTGGAACCAGTGT 




KCP 3858 
9 


TCAAACTTTTCATTTGCTCAAAGCCTACAGCAAACTCAGTCCACAC 

ACTTGGCTATACAAGAAAGGTTGCTTTCTTTGTTGTTCTATAACTG 

ACTTTAATTTCAACTTCAAGTCCCCATTCTTGCCAAGGGGTAGAAA 

TGGAATCTTGGTCAACTTAGGTTCCCCTCCCTACTCTCTGGGGTTG 

CATTTCCAGGCCAGGCAGTTTCTGCTGGTGCTTTTGTTCCTTGGTC 

CTCAGTCTTCTTTCTGTGTTGACATCCATTGACATGTCCTCGACTC 

CCCTCATCTCAGATCACAGGCCCATGCTGACTCCAGGAGTATTCTT 

GTATTCTCTTCATCTGAACCTCAACACTTTTTGAGACCACGCATGC 

ATGTGCTCTCTCTTTCTCTCTCTCTCTAACACTTCTGGAACACTCT 

TGGACATGAGGAGATATTGGTCTTTCTAGGATGGGGTCAACTGGCC 

CTGCCTCAGATCCATTGGCCTGTACATATCTTGTAGCCATTGTGGT 

GCCATGGATCACAGGTCACGATGCTGTGTGGCTGCCTCTGCTCTTA 

GACCTGCCCCCCATGCCACCAGAGGGAGTGTCTGCCTCCCCCTGCC 

CTGGACACTCAGCTGGAGGGGAGGGTCACAGTCCCTCACAGTCCCT 

TCTCCAGTGACAAGCAACAAACTCCCAGTCTTCCTTTCTTTCTGAT 

CCTCTCCTCCTCTTCCTCCTTCTCCTCTTCCTCCTCTCCCAGTCCA 

AGGAAGTTTTATGCAAAGGCCAGAGGAGGGAATAATGAGGTGGAGG 

TCTCTCTGACCAAGCATGTAGCCTTCCGGATCTGTTGTGCTTTCCA 

GGAGTCCTTCAAAGCTCTAAGCTTTTGGAATTCTGCAAGCTCAGGA 

AATTGAAAACCTTTTCTCTCACAACTGCAGGTCTTTGTCTGCAGTT 

GTAAAAGTCTGTTTAGAAACTCAGGAGACAAGCAGCATCTTCTTTG 

TTCCCTGCTTTCTGGAGGCAGTCAGCGTGGAACA [A/C] CCTGCCT 

GCAGTCTGACTCAGGGAAAGGGTCACTGAGTGTGTGTGTGTGTGTT 

GAGGGGTGGATAATAAGCAAGGAGAACACTCAGACAGAGAGCTCAC 

AGAGGGGCACCCCAGCACCTCCCTCACCTCTATATTCCCCGCCTGG 

GCATAGTGGAGGGAGGGTTAATGCCAGCCAAGTTTAACAGGCATTT 

CTGATTCGCGGCATTGTTGTTGCGCTATCCTGCAATCCTACGCTGC 

GGGTACTGTTTTTATCCTGATCCTTCAGCTCTGGAAACTAATATAG 

AGAGCTGAGTAACTTGCTTGAGGCCATGATGCCAGGATCCACGGTG 

CCCCCAGGCTGAAGAGCCTTAACCACTGGGCTGTACCACCTCACAG s 

GAGGGCAGGTGGCACAGTGCCTGGAACTTGGGAGGGTCCAGCACGT 

GGAACTATGCTCTGTCATTTACTTACTGTGTGTCACTGGATCAGTC 

ACTCAACACCGCTAAGCCTCATTTTCCACCTCTTCAAAAGGGATCT 

AATAAACCTGTTAGCAGAAGGCTGCTGTGAACACTAAATGAGGTGG 

CTTAGGTGAGAGCTCTGGTCTGAAGATGCTCACACTTTGAATCTCA 

AGACTTGTGTGAACCAATATCAGATTTCTCCTATTAGATTGCAATT 

CTCAGGGAGTCACATTCCGTCTCCAAATGCCCATCTCCTGATCCAC 

AAAATGAGCACAACATCTCTGATAAACGGTAACTAGATGGTTCCAG 

TGGGCAGCGGGAGTGGGAGGGCGGTTGACTGGGCCAGAACCTCAAA 

TGTATTCCTGTGTAGTTTCTCATGCATTCATTCAGTTTGGCACCAG 

AAGGTGCCCAGACTCACTTTGCAGCCAGTCTGTCCCCATAGAGGTG 

ATAAAGGAAAAACATATGCACATTTAAACTTTTAAAAGTTTATTTG 

AACATTCAGCGATTCACAAACGGTATAGCACAGACAGCAAGCAACT 

AGCACTCCTCTAGGAGGGGCCAAACAG 


SEQ ID 
NO. 153 


KCP 6519 
9 


ACAGAAATCCTTAAGAGCATCAGCCGTGACACAGAAATCTAATACA 
ATAAAACAAAGTGCTTATAAACCCCAGAGTTGTTTAAAACCCAGAA 
ATTGCCAATTGACATATGGGACTATATCTTCTTAGCCCCTAGTAAA 
CTGAGTGGCTTCAAACAAGTCCCTATCACCTCCCAGGGCCTCAGTT 


SEQ ID 
NO. 154 



-145- 





TCTTCACCTGTGAAATAAGAGGATCAAAAAAAGATAATGTTCTCTC 
TGTTCTCTTCCAACCGAGGCAGGCATCTCAAGTATTTCTTAGTCAG 
TTCTACTCTAGGCTACACAGTATCTGTATCTGGCAGCTGTATGAAC 
TACTGTTGAAAATCCTCTTCCCAATCCCAGTTTCAACATCACTCCT 
CAAGGCAGCATCCACCTTCACTCTAGACTGAATTAATTCCTCTGTC 
TTACCACCTAAACTCCTCTAGAAAACTTGATAGAGGTAAAGATAAA 
TGCATTTTTTCAAAAATTCTACTTTTCTAGTCCCAAGGCATTGTGT 
ATATCATTCTTATGTAAGTTATCACAATAAACCCATAATTAGTTAC 
TTCCATTTATGTCAAATCGCCTACAAAGCAGAAACATGTATTATTC 
ATTTTTGGCTTCCTCCCCAGTATCTAGCATACGAACTGTTTGCAAA 
CATGCCCAGTTCTTCAAACTTTGTAACTTCATGCCTTTTCTATCTA 
CTACTTGGGATGGGCCCACCCTCCCTTTGTCCTCTAAGCACACTCC 
TATTCATCCTTCAAAGTCCAGCACAAAAATCCCCTCCTCTGTTAAA 
CTTCAACTGCTCCAGGCTGAGTCTTATGTTTGGGTCCTTCATACGT 
ACCCCTCTTCTATTGTTTGGGGTATTGTGTGCTGTGGGATCTGTTT 
ACTCTCAGTTCTCCCCTCTAGGCTGGGTTCCTTGAAAAACACCCTC 
TGGACATTTCACCTCTACATCCTCTGCATTCTTGGCCAGGCTCTGA 
GAGGGC ATTGGTAAATGTT AACTGC CTGGCAATG [A/G] TGATGCT 
GTTAACCTGATGTGTCAGGGGTCTGAATAAAGCTGCCTCAAGGTAG 
GCAGATGCCCACAACCAAGCAAGAACTCAAAGCTGCAGGCTCCTCA 
GCCTGAACCTTAGACAGCGTCTTGGTCACCATTTCAACACCTTGAC 
CACATTTCTCACTCTCCCAAATTTCCTCCTGCTTATTCCTCATCCA 
CATACATAAGGCTGTGTCTCCCAGGGGAAATTCAACTACTTGGTAA 
TTATCCTGCTTCTTAAGTTTGGGGCTAGGGGATTCATAGATGATGT 
TCAGTATTATGCTGTGCAATGTAGATGCTTCCTAAACCTTCTCAGG 
AGCTACCACTGAGTGGCACCTGGGGACCTCTCAGGAAGAGCCAGTT 
TTCTGGGCAGTGTGGGGCAGGACAGAGCTCATTAAACCAGCCTACC 
ACCTGTCTTCCAGCTCCTCCTCTCAGCCTCTGGGCTTCCAGCAGAA 
AGCACACGAGAGCATTCTTGTTGGTTTTCTTATGACTTGAGCCAGC 
GAGACGTACATGCCCAGCACCTGTTACCTGGGCTGGCTCTTGGCTG 
AGAGCATACATGCATTGGGTCAGGTTTCAGATCTGCTGGAGGAACA 
CAGCCAGAATGTCTTGACAGGCAGCCCTGGCAAAGCCCCAGAAAAT 
ATAAGATCTGAGTCTTATGATGGACTCTGTGACCTTGAGCCTCTCA 
CCTCGTGACCTTGGGCATCTCATGTTCTCTCCACAGGTCTCGGTTC 
TGGACTCCTTCATGGGAGCTGTCATGCCCCTGTCACACAGCAGTGT 
TGTGCCCCCGGGGATCAGGGACCAGGATGGTCCTTTCTTGGTGGTG 
AAGGGGGCATTTTGCATATTCCAGAGATTCAAGTTTCCAGACCTAT 
CTAGAAAGAAACATTTGAGTTTACAGGTTGGCGCTTCTCAGCCTCT 
GTCTCTCTTCCTCTCTGTTCATCTCCCTCTGTCCCCTCTATGTATG 
TTTGTGTCTCTTTCTGTCTCCTCTGCC 




KCP 8246 
8 


CTCACTGCCTGCAGTTTATTCAGGCATTGGATGAGACAGCTTCTTC 
CTGCTCCATGTGGAGTCAGCTGGGTACTTGAACTGGGACATGGATG 
ATCTACTTTCAAGATGGCTTATTCTCAGGGCTGCCAAATGGATACC 
GGCTATCAGTTGAAAGCTATAAGCAGGGGCACTCTGCATAAGCATG 
GCTCATCTCTACAAAAGCTCCTCCCCAGTCTCCTTGTTTGGGCCTC 
ACAGTGTATGGTAACCTCAGGGCAGTCAGAATGTGACAACTAAAGA 
CTTCAGGAGTAAGTATTCCAGGAAGCAAGATATAAGCTATGTGGCC 
TTCTAAGACCTAGCCTCAGAGGTCACATAGTGTAACCTCTATCACA 
CCCTATTGGTAGATATTGTAACAGAAGCCCACCCAGTTTCACAGAT 
GGGGACATAGACTCCATTTCTTAATAGGTAACTGGCCAGAGTTGTA 
AAAGAGCATGTGGGATGGAAGATATTGTTGCAAGCATCTTTAGCAA 
ATACAACTGGACATACCCAATGCAAGCACAGGATTGATCCTCCACT 
CTGCCCCCATACCCCATGATTTATTAGCCACTCGGACAAGTGACTT 
CAACTCTCCAAGCCTCTGTCTCCTCCACTAAAGTGGGGACAAATGA 
GTATTACAAATGAGACCATTAAATAAGATAATACATTTTAAAAATT 
AACCTGGTACCTGTCACAAAGTACATGCCTAACAAATGTTTGCTTC 
TGTCTCACTTCCTCAATTTCATCTCAGTCAACCTGGACTGACTCAA 


SEQ ID 
NO. 155 



-146- 





AATGGCATTCTTCTTGGCTGCCCCCTTTGAAGTATTTCTGCTGAGA 
AAATAGTTTCTGTGTATTTGTAAATTTACAGGTTGAACATAGATCA 
TTATTCAAGCATTGCTGGTCGATTCGTCTTTTCAAAGGCGGGAGCT 
GCTGGCTGTGGGAAGGGACCCAGCAGGGGTCTCTTGCAACCCTGCT 
CTATGGGTGGGGGAAATCTGGACCTCCCTCTGGT [A/G] GGGTTGA 
TTGAAGTGAAGGGTCACCATATGTCTTTCCCAAGAGGGTGACTGAC 
TTCCTGCTTTGGTCCCAGTTTCCCTGAGATTTTCCTGAAAGCCCTT 
CCGGCTAGCCCAGTTGGGAGTGTTAGTACATCAGATCCCATGCTTT 
GGTGAAAAATGTAAACACAGACCTGATTTTTCATTTTAAATGAAGC 
CAAGCATATTGCTCCCAGCAGATGCCGAGTGACTCAATCTGTCCTC 
TCGGTTCTGAAGGGAACTGAAGAACAACATGGTAAAATAAAGCAAA 
CAGCACATTTATTGGTTGATAAAATGCTGTTTTAGTCTACCCTGGC 
ATTATATGGTGATTGCTATGTGGCGAACATCTGTTATTAAATCCAG 
ACTTCTGTTGCCTGGATACATTGAGTCAAAAGCTGGAGCGGATGAG 
AAATCCATTTATGCGTCTGTTGCGTGTGAATGTCAGAGCTCATATG 
ATGCCTTTGTCTTCATTCTAACTGAATCTTTTAATATGGACCGTCT 
CACTTGTTAATTCTGACTCAGGGGCAATAATGTTTTCATTTGATTA 
AAAAAGGTTAAAGAAACAAAGAAACAGTGTTTTCTCAGGTGCTCTA 
AGTAATTCTGTTAATGAATTTTCGGAGACAGCGTGTGAATTTGAAA 
AGAGTAGGACTTTTTAAAGAGTTCATACTATGAACCCAATAATTCA 
GATCCTAGGGCCTTATCCTAAGGACATAATAGAAATGAGCACATTT 
ATAAGAACAAAGATGTTCAATGAAGTGTTACTTACAACAGCAAAAA 
AACTTGAAAGTCACCTAAATGTTTGTAAGTCAAGAGCTTCATTGAT 
ATTGACTGCAAAGTCCATGTTATTCCATGTGACGAATTTTTTAATC 
AATCACCTCTTGATGGATTTTAAATTTTTTACAATTTTTTGCTATC 
CTAAAAAAAATGTGTCAATGAACAACTTTGAACTACCCTGACTACC 
ACTTTAGGATAGATTGCTAGACGTGGA 




KCP 8579 
3 


ATCACCCCAAATAGTTATGATGAAGGTGATCTATGTACGACACTTA 
GAGAATCAGTGATGGAAAATTCACCAAGAACAGCCACAGGCAGGCC 
AGAAGAATGGCCCTGCCCCTCTACTTTTAGGATTAAGCAGAAGCTG 
GCCCTAGATCTCACCAGTTACCAGTGATCTTGGGCATTTTTAGCAT 
CATGTGCATTGCTTCACTGTGATACCATCTTGCTGGCACAGCCATG 
GAAAGCCATGAGTTAATGCATCTCCCCATGTAACAAACCTCCCCTA 
GGACTCTGGTCCACACCTATCTCTGCTAGATTCTCTGGCATTGCAA 
GAAATTCTTCAGACTGCCCCAAGAGATTCGTTCCAATCTAGGGGCT 
CCTTATCCCCAGCTCAGAGCTGGATTTGGCTCTTGCTTGGAGGCGG 
GAAGCCCTGCTGGGCCAGGGCTTAGAGGGGCTCACAAGAAATCAAA 
GCAAGCATTCTCCGCCTCTCTCCTACAGCCCTGCATGCATCTTCTC 
TGATCCCTTGCCTGAGTGGGGGGTGGCATTCCAAAAGCTCATTACT 
GGCTTACATACTTTGCCTTAAATCAGCTCTTAAATGCCCTGGGATG 
AACAGCCCTAAATAGGAAAGAAAAAAAAAAAACAAGTTTCTTGCAA 
GTTCACAGATATGCTTGGTGCTTTCTGTCAGGCTAGGGTGTAGCCT 
TCTCTGTTCTAAATTTGATTTTCTGAGTCTTTAAGGAAAAATGGCT 
ACTGGTCCCCTGGACGCTGATTGCTTCAGCATCTGAATCTGCTCCA 
TCACTTCTACCTCCACCCACTGGTCCACGTCCAGTGGGTAGAGGTA 
AAGGGGATGGAGATATCATTTATCTTCAAAGGATAAAACTGCTCTG 
AGAGATCTTTGCTTTCTTAGAAACACTGCTGGAAAGTTGTTTCTTT 
AGACTACATTAACAGAAGTACCATCTCTAGGAAGACAAGGTGGTAA 
TAACTAACATCAAATGAGCAGTTCCTATGTACCC [C/T] GTACATG 
TCTTAGCCAACTTCATCCTTGTAACAAACCTGGAAGGCAGGCACTG 
TTATCACTCTTATTCCCAGGTGAACCAGTTGAGGTTCCAAGAAGTC 
TTTTGTGCAAGGTCATGCAGAGTTGAGGCCCCCAAGTCGGTAGACT 
TCAGGAGCCAGACCCTCAACCCCCTCACTGCCTCCCGCCTCATGCT 
GCACTGAGCAGACCATACCCGGATGGTCATGTTCAGGTTGGCTATC 
AATGCAGACCACGCTGGGCATATTCAGGGGACGGATACTCAGAACT 
ATATAACATAAGGAATAGAGGAAGGACTGGAGGATGTATTAACATG 
AAGAAAAGGTAGACTCATGGCAGGAGATGAGCAGGGTAAAGAGGTG | 


SEQ ID 
NO. 156 



-147- 





CAAGACATAAAAAGCCAATTTCATATACATGAAGATTTATCAAGAG 
CCAGAAGGC CCTCTATGGGTC CAAGAGTTACAAGGCCTAATGAGGT 
GAATTAATGCCAGCATATAAGGAAAAGCTTTTGAATACTCAGAAGT 
GTCCAAAAAGGGGTCAGGCTGCCTTGGAAAGTAGTAAGCTCTCCAT 
CAGAGGCTTGGCAACTTCTTATTAGGGATGGTATGAGTATCTCAAG 
TACAGATACAGATGACCCAAATAACCACTGAGGCACTTCTGACCCC 
AAGTATAAGAGATTCTATTGTAACGCACAGGAGTCCATCTCAAGCA 
GCACACTGAGCCATCTCCTTGATAAACCTAAAGGTAGGTATTATTC 
CTCCCAGATGCTGTCTTCTTAGCCTGGGATGCAAAAGCCATAGGAT 
CACTTCACGTCCAACCCCCATCAGGTGATCTGTCATGAATCACAAG 
TTATTGGAGCCAGATGGAACTACAGAGCTAAAAGATACATGAAGAC 
ACCGAGGCCTGCAGACAGGGACTAACTTTCCAAGGTCACAGAGCTA 
ACAAGTGTCAGAGTCAGGCTAGACCCAGGACTCACAAGTTGAGCTC 
ACAATTAGTTCCACTTCCTACACCACC 




KCP 9354 
5 


CCTGAGCCTCTGCCTCCTTCTGAGAAAGACCCTTGTGATTACATCA 
GGTTCACCTGGATAATTCAGGATAATCTCTTCATCTCAAAATCCTT 
AAGTTGATCACATCTGCAAAATCTCTCTTACCATGTAAGGTAACAT 
ATTCACAGC TTCTGGGGATTAGGACATGCATCC CTAGGGAACCATG 
ATTCAACCTAGCATGGGGGAACCCACTACAGGCAGGTGTTGTCCTT 
GCCATCGCCAGCTCAGTGCTTGGCACAGTAGAGGCCATGGATATTC 
ATTCAGAGAGAGCATGCACTGAGGCAAGCCTGACCTCAAGATCAAG 
ACAGGAAATTGGCTTTCATGGGTTAAGGACCTGTTACTTTGCTCAT 
CAATGTATCCTTAATCATCAGAGGTCAGATCTGCTGGAGAGTGCAA 
TCTTTCAG [G/T] TTCCAAAAGTAAGACTGGATGCCTTAGAACTTA 
AAGTCAGGGAGGTACCCAAGAAAGCAATCATAGACTGAGTCCCCAT 
GCAGTGCACTTTCTCGGATGGACAATTTCTCTGTTCTGACAGTCAC 
TGTTGACTCCATTTCTCAGATGAGGGACCGAGGCACAGAGAGGTGC 
AGTCAGTCACCTGAGGCCACACAGTCAGGAAGTGGAAATCCATGGA 
AACTCATCATCAGCTGCCTCGCATCAGGGCCAGTGCTCTTTATCTC 
CACCCCACACATTATAAAGCCACTCAGCTTTACACTCAAGGGAACT 
TCCTATTTCCCTACTGGATTATATGTATAATTTGTAGTATTGCAAG 
ATTTGAACAGAAGCGAGCAGCAGCTTGTAGTTGTGTGTGTCACTCA 
CTCCTGCCTGTGGGGATGCCACGTGATTGTTTAAAGGGTTGGAATC 
AGGAGAAAGGCAGGCTCAGAGCAGGACCAAGAGAGAGCCCACCCCT 
CGCCTCCC 


SEQID 
NO. 157 


KCP 9784 
4 


ATTATAAGTATATACCACACTTTGTTTATCCATTCACTTGTCGATG 
GAAATTTGGGTTGCATCCACCTTTTTTTGCTATTGTGCATAATGCT 
GCTATACACATGGCTGTGCAAATATCTAATATTAGTCCCTGCTTTC 
AGTTCTTTTGGATATGTATCCAGAAGCAGAATTCTTGGATCATATG 
GTAATCCTATTTTTAATTCTTTTAGGAACTGCCATATTGTTTTCCA 
CAGCAGCTGCAGCATTTTACATTCCTACCAGCAGTGCACAAGAGTT 
CCAATTTCTCCATATCCTCACCAACACTTGTTATTTTCTGTTGCTG 


SEQ ID 
NO. 158 




TTCTCATTGTGGTTTGCTTTGCAGGTTTTGATTTGTAGATTTTCCT 
GATGATTAGTGATGGGTGCATCTTTTCATGTTCTTACTGACCTTTT 
ATATATCTTTCTTGGAGAAATGTCTGTTAACTCTACTCATACTTTT 
GTAAATAGTATTCCCAATCCTTCTAACTCCCCAATGAGGTGGATAT 
TAGTATGTTCGTGTTACAGTAAAGCCAACTAAACCTTAGAAAGACT 
AGGTT [A/T] ATTATCCAAGGTCACACAGCTAGAAAATGACACAGC 
TTGTATTGAAACATCAGTTTTTCTCTTTCCAAACCTAACGCACATT 
TCATGAAACCTACATTATTGCACCATAACATCATGTTGATTTACTT 
ATCTGCTCTCCTGCCTGTCCCATCTACTACATAAATTGAGTGTGGT 
TTGAAATCAGAGACTACTTCTCATCTTTGGCACAGTGGCAGCCATG 
GATCAGAATCTCTTACATGCTGGATAAGTGGATGCAAGCTCAAGGC 
CACACCTAAAGTCCCCAGGTGACTTGATCACTTGAGTTAGCTGCTG 
GAAACCTGGGCTTCCTCTTCTGCAAAATGGGGAGAGAAAATAAATT 
CTCAGTGGATTGTTTAGAAGATTTGAGCAAAGACCTCTGCAAAGTG 





2345.2049-005 



-148- 





CTAAGCATGTGGCTAGCATGTGGCAGGTGCTGCCTAAATAGTAGAA 
ATTAACACTGCCATGCTTATAAGCTCCGGACAAACACAAGAAGCCC 
GAAACATAATCTGTGCCTTCTGCTTGCATTCCTCCTAGTTGGGGAT 
GTAAAATAGCCCAGCTACAATCAAAGAAGAAAATCAAAGTCAGCAC 
AGACTATGGATATGCTTCTATATGTGTAGATTATTTCCAGACTCAT 
TCGGAAGAATCTGGACATACTGGTTGCCTCAGAGGTCAAGAAAATT 
GGCTCATTTACTTCTGTAACTTAATTTCGACTCTCTATGCTTTTAC 
ATAGTTGGAATTTGCCATGCACATATACTACATTTAAAAGAGCGTG 
TACGCG 




KCP_1028 
82 


CACAATTATGCTGTAGGTGAGTTTTACCTTGGGAAACCAAGGCACA 
GAATTTAAGTAACATATTGAAGCTCATGCAGCTGCTAACAGGGAAG 
GCCAGGGTCTGAACCCAGCTGATCCGGCTCCAGCATCCGAGCTCTG 
AACCACTGGTCTATCCTGCCTCTGTTAGGACTTGGTCCAATGTCAT 
CATCCTAGAAGGAACATTTAGGCCCGCACGGTGGGTGGCTGGTTCA 
ATCCAGTTTAAAGGCCAGGAGCAGGACAGTGACTTGCAGCTGCAGC 
AATCCTATGACTCAAACCAAAGCAGCTGTGACAAATAAAGGGACTG 
ACTCTCATTCTCCCGTGCTAGGGAAGGATGAGCTATCAGGCCTTGT 
TGCAGGCTGAGTCAGTCATCCCACAAACCACCTAAGTGAAACCTCT 
TCACTGAGCCTTATTTCCTGAGCGCTCTCCCTTTATCTGTGCTTGC 
AAAGAGG [C/T] GTCTCCCTCCATGCCAGCCAACCCACCCACCCCC 
GCACACACATACCACCTCTGGCTGGAACTGACGACCATGGGTTTTA 
GAAATGAGATAAATCTGGGAGATGAATGTATTCATGAGCCCATAAA 
GGGGTCATGAATCACTGGCCCCAATTACTGCCTTCAATCCTGACAG 
GATGAATTCCCTCAAGCAGATTCTCCTTGTCAGACAACACGGGAGG 
CAGTGTCATGGCTGATCTAGAGCCACAGATAACATCATTATTCCAT 
ACCAGGCTGGTTTCGGTTTCCCAAGCCACCTCCACTTGATTTACAG 
CTCACTTCTGATGCTGGAGAGAGAGATAAATATATATATATATATA 
TATATATATATATATATATATATATATATGAAAGAAAGAAAGAAAA 
GAGAGAGAGAGAAAGACACAAAGGGGAAGCTTTCATGCC 


SEQ ID 
NO. 159 


KCP 1073 
80 


ATCCCAATAGGACACATGTTGTATTAAAAAGCCATGCGAGACGGAA 
GAAGGAAATTGAATGAAATTTGAGGGCAGGTAGGAGCAGAGACAAT 
AAATAATTCAGCAGTGAAGGAAGCAGAAAAAAGATTGCACTCATTT 
CGCCCTTCAACAATTATACTAAACACCTGCTCTGGGCCACAGAAGG 
GCCAGATCCCATTCCTGTGCTCAGGAAGCCCACAGGCCGGCAGGGA 
GAGGCTGGTTGGAATGTGTGCTTTGCACTGTAACGGAGGCATCGAG 
CATGGTAAGGGACTGGCGGTGGCTGCTGCCTGCGGACGTCGAGCAG 
GGGCCTTTGAAGAGGCAGGACCTGTCTGGAGTCTTACCTGGGCCTT 
GGCCCTGGCAATGGGGAATGGAGCAGGCAGCAGGGGACAGATGCTG 
CCAGA [A/G] ACCGAGATGGTGCCGGAGGACTGGGCTGAGTCTGGG 
TCAAATGACACCGCCCCAGGCTCTCTGCCCTCTGGGGTGAGGCAGG 
AGGCTGCCTCTGTGTGTGATTCAGAGACCCTAGAATCCCAGTGGCC 
ATCACCCCACAGCACATGCCAACCTTTCTGTGATAACTTTCTCTTG 
TGGAACTGTGAAAGTGTAAGACCAGCTCCTGTATAGTGCATGGCCA 
TCCTTTGCTTTGGGGACAGTAAGTCAGTCAACACATACTTATAAAT 
GGGGTCCTGGGCCGTGGCACTGATCTGGTCCTCCCACCTTGCCTCA 
CACTGCCCTTCCCACTCACCACTTCCCTCCTCTGCATCTTAGCCGC 
AAGGGACTTTCAGACCAAGCAGACCTGGAATCAAATCCCACCGCTG 
GGCCTCAATGCCAGTGGAGACAGGAACAGCTGATCCCTGGAGCCCT 
CAGGAGGAAGAGGACGGGATGCCTGGC 


SEQ ID 
NO. 160 


KCP 1087 
03 


CCCACTCACCACTTCCCTCCTCTGCATCTTAGCCGCAAGGGACTTT \ 

CAGACCAAGCAGACCTGGAATCAAATCCCACCGCTGGGCCTCAATG 

CCAGTGGAGACAGGAACAGCTGATCCCTGGAGCCCTCAGGAGGAAG 

AGGACGGGATGCCTGGCTTGGCTGCTGGTCTGGGGCAGGTGCCCAG 

TTACAGCAGTTGGAAAAATC C TCAGTGTTGGAAGGAAATTTGGAAG 

TGAGCATCTACCTGCCTGCCGTGCAGTTTGTGACTTTTAAGATGGT 

TGACAGAACATTCCCAAAGGACCACAGCGGTGACCACTGTTCTCGT 

TTCCCTTTGGTGGCTCACTCACTCAGTGCTGGACACAGTGGTCCTG 


SEQ ID 
NO. 161 



2345.2049-005 



-149- 





ACAAGACAGTGCTGTGGCTTCCATGAACCTAGGACAGGGATAGACT 

CAAGGACTAAGAACAAACCAGGAAGAAGCATCACCACAGGCTCCTT 

GCCAGTCACCTCATCTCACCCTCCTGGCCCTGGCGGATGGGTCTCC 

ATATTTACAGGGGCCAGATGAAAAAACCAGAGGAGCCAGGAAAAGG 

AGCTTCCCCTTCCCAAGGGCGCAAGGTGAGGTGCCAGTCATGAGAT 

GCAAGCCCTGAGCTTTCTGATTCCACTGCATGTGGTCCCAAGGTTC 

GGCGCCGCATCACACAGTTAGTGAGCACACTCTCCTCCCCTGGCCC 

CGAGTGAGCCAGCTGGATGGCAGATCAGAAAGAGAAGTCCCGGGTG 

CCCCCAACATGGCTAGCTCCTTCCAGGACCAGGGGCTAGGCCCCAG 

CTAAGGCTGGTGCACACAGCAGGGCAGGGGGCGAAGGAGTGGGATC 

CCACCCAGGGATCCCACCCACCCCAAACCTGCTTTCGGACATCTTT 

CCAATGCATAATGTGCAGATGAGGCCCTTTGATAAGGACCAAATCC 

CTTTCCGTTGCTTGGCAACCTGGCTCACAAGTCATAGCAGGGAAGT 

AATTTACAGGAATTCAAAGTGTCGCTGGAGGTTC [G/T] GCTGAGC 

TGAATTGCTGCAAAGAGGAACCTCAATGGTCCAAATCACACCTCTG 

GCGGGGAGGAGGGGCTGAAGGAAAAGCTTCCACTTCCGTCACTTGA 

GAGTACAGAGCCCTGAGCTCAGACTCAGCGATCGTTTTCCATTAAC 

GGATTTACTGGTTCCATGTTGAGCTCCTGCTGTGTGGCAGGCCCTG 

TGCTGGGAGCCAGGGACACAGTGACAAACGAGACAGATGCCAACCC 

CGGATGCACAGAGCTCAAAGAGACAGAGGAGTAAACAGGGCTACAC 

ATGTGACAAGATAGGCTGTGCACAGGGGTCTGAGCAGGACCCTTGG 

GGCAGGAGGAGGCAGTGGAGGGATGGGAGGGTAGGGACGCAGTGGT 

GACCAGCTAGCCAGATAGAGAACAGAGGGTGTCCCAGCACAGGGCC 

ACACAAGCAAAGGCAGAGGTGGGGAGAGAAGAGCCTGCCACACTCT 

CAGATCAC C ATGTGGTTGGGC CAGGGCCCCAGC TGAGGCTGAGGAC 

ACATGGAGCCCAGATCCGGCAGGGCCTTGAATGCCAAGTCAGAAAG 

CATCTGAAATTTAGTCTACAGATGATGTGGGTTATTGACAGCCAGG 

ACAGGGAATGACATTTGTGTTTCAGGAAAACCACTGTCTTCACTGT 

TAGGGGGTAGATTCAGGGAGAAACAGGAGGTGGAGGGGAAGAAACT 

GTGAGTAAAGGAGTCTCTGGGGTACAGGTGAAGTTTCTGTGAAACT 

GGAGAAGAAAACTGTTGAGGCAAGAGTTGACAAAACTTGAAGTAGG 

ATGGAGAGGAAGGGACAAGTTCCCCTGGCATGGTGACGGCCCGGTG 

GTGGGAACCAGGGAAGAGGAGGGGCTTTGCAGGTGTCTGACTTGCC 

CAACAGGTGGCGCCATTTACCAAGATGGGAAGGGCCGGGGAGAAGG 

GAGGGTTCCATTCTAGGGAAATCTCAGGTCCTCGCTATTAGGATTC 

TTTCGGTTGCCAGTGACTGAAACCCAG 




KCP 1248 
77 


ACTGTCTAGATCTGGGGACCCTCCCAAGCTCTCAGAGCTTTGGAAG 
GAAGGTCCCTGCAGGGAAACTGTGTGTGTTTCTTCAACAGTGTATC 
CTCAGTGCCTAGCACATGGTAAGTGTTCCATAAACAGCTGTTGAAG 
AGACGGATGGATAACTGAATGAATGGATGCTTCCATGGGCAATGAC 
ACACTAATCTGAAAAGCCCTGTATCAATGAAAGAATCACTTAATAG 
TTTAACTTTTCCCTCATCCTTCAGAACACAGATGGCATGCCATCTT 
CCCTTCAAATCTCTTCCCAGTGCCCCACACAGAAGAGGCACACTTG 
GACACTGGTGTCTGATGGACCCAAGTTCACAGCCTGTCTCTGGTCA 
TCAGGTATCATGACCTTGGGCAAGAAGCTTAACTCTCTGAGCCTCA 
GTTTCCCCTTCTGTCCCCCAGGGAAAATGAGTCCTGCCCCTCCTAA 
GGGAGGTATGAGATGTAAGACCCCGAAGGACACAAAGGTT [C/T] G 
CCAGGAGCCTTCAGGTAGGAGGCAGGTAAGGAGGTCTGCTAGATTG 
GAATGAGTTTCTGGAAGGCCCCAAGGAGCTCAAAATCAGACCTGGG 
GTGAAGGTGTCTTGACCAAAATGAGACCCATCAAAGAAGCCTGGAT 
GAAGGTGCCCACAGCATCCATCAGTGCCAAAAACAGAAACACTTTA 
GCCCAGGATACAAGGAACATTTTAAAGCAACAGAGATAAGAGATAG 
TTAGAACTCAGGCCTCCTGGCTCTTGCTGTTCTTGGCCCATAATTA 
GTTGTTATGGGACCTTAATAAACTTCTTGCCTTCTTGGTACCTTTG 
CCAAACAATCTGATGAGGAGAATATTGAGTCATGGTGCCAGGGAAA 
ATTAGCATATTCTGCAAATTCCTGGCACTGTTAACACTGGATTCTG 
TCCACCTTTAGAAATCCTCAGATCACTATGTCAGCATCCCCCAATC 


SEQ ID 
NO. 162 



-150- 





ACAGCTCTCCAACTTCAAGGAGGGTTGAGGGGTCTGAAG 




KCP_1260 
86 


AAGAATATCAGTTCCACTTCCCTTGTCCCTAGAGAGCCTTGTAGTG 
GATGTTGATGTGTCTTCCAACACATGCACCAACCTTTCCCTGTCCT 
GTAGCAGTTGAGATGGAATCATCCCACTCCCAGCTCCAGGAATAGG 
CTCTGATGGGCTTGAACCCAGCAGCTTAATTCCATTGGTTCTCTAG 
GCCTTCATCATTAGTACAGGAAAGGCACTTGACCTAAATTAGTTCG 
ATAAGATTTAAGCTCAGAAATCTGGTTTGTTGGATGGAGAAAGAGA 
TGCTTTCTTTCTCTCTGGAAGGAGTTTATTGCAAAAGTAAGGGCTG 
GGGCTGCTACAGCCATTGTGCTACCATGAGGGAACTAGCCATGATA 
ACAAAACTTGCCTGGGGAGGGGCTACGCATCACAGAAAATGATGCC 
AAAGTCCTGCTCAAACTGTGCCTGATGCCTGCCTGATCTATGGACT 
TCTTAGTTCCATGTAATGGATTCTCTCTATTTTTAAAGCC [A/G] T 
ATCAGGTTGAATTTTTGGAGAAATAAAACAAAAAGCATCTTGACTA 
ATTTAAAAAATCTTCTTTGGGTATTCAACCCTCCTAAACTCACCCC 
CAAATCCACTGGGAGCATGTCAAGATTTTTGTGAGCCGATTTAGGA 
GATGCAAATTCATTTGCCTTAATTGGATCTCCAGGAAATGACTTCT 
GCCCCCTCTTAAATCATTTAAAGCTCAAAGAGGCATGAGGGCCCTC 
CCCAAGGATGCAGGTATCCTCTTGACTGACAGCCTGTATGCTCTGC 
TTCCAGGATCCTTCCATCTCCTCCCTTTACTGAGGGAGTCTGCTAT 
GTGTTAGAGGTGTCCATCACTGGTCACACTGGGAAGCTGTGGCAGG 
GAAGCTGGAGAAAAAGCAAGATAGGCCCCAGAAAGAACACCAACTC 
CAGACTCAGGGAGACTCAGGCCAGAATCCTAGCTCAACTTCTTCCA 
AGCTCCCAAAGTCACACTCTTTTCTCTGAGCCTCGATTT 


SEQID 
NO. 163 


KCP 1262 
08 


ATCCCACTCCCAGCTCCAGGAATAGGCTCTGATGGGCTTGAACCCA 
GCAGCTTAATTCCATTGGTTCTCTAGGCCTTCATCATTAGTACAGG 
AAAGGCACTTGACCTAAATTAGTTCGATAAGATTTAAGCTCAGAAA 
TCTGGTTTGTTGGATGGAGAAAGAGATGCTTTCTTTCTCTCTGGAA 
GGAGTTTATTGCAAAAGTAAGGGCTGGGGCTGCTACAGCCATTGTG 
CTACCATGAGGGAACTAGCCATGATAACAAAACTTGCCTGGGGAGG 
GGCTACGCATCACAGAAAATGATGCCAAAGTCCTGCTCAAACTGTG 
CCTGATGCCTGCCTGATCTATGGACTTCTTAGTTCCATGTAATGGA 
TTCTCTCTATTTTTAAAGCCGTATCAGGTTGAATTTTTGGAGAAAT 
AAAACAAAAAGCATCTTGACTAATTTAAAAAATCTTCTTTGGGTAT 
TCAACCCTCCTAAACTCACCCCCAAATCCACTGGGAGCATGTCAAG 
ATTT [T/C] TGTGAGCCGATTTAGGAGATGCAAATTCATTTGCCTT 
AATTGGATCTCCAGGAAATGACTTCTGCCCCCTCTTAAATCATTTA 
AAGCTCAAAGAGGCATGAGGGCCCTCCCCAAGGATGCAGGTATCCT 
CTTGACTGACAGCCTGTATGCTCTGCTTCCAGGATCCTTCCATCTC 
CTCCCTTTACTGAGGGAGTCTGCTATGTGTTAGAGGTGTCCATCAC 
TGGTCACACTGGGAAGCTGTGGCAGGGAAGCTGGAGAAAAAGCAAG 
ATAGGCCCCAGAAAGAACACCAACTCCAGACTCAGGGAGACTCAGG 
CCAGAATCCTAGCTCAACTTCTTCCAAGCTCCCAAAGTCACACTCT 
TTTCTCTGAGCCTCGATTTTCCCATCTGCAAAATGGGGATACTAAG 
GGTCACCTAGCTGGGCTGCCCTGGAGATTCCAAGACATTA 


SEQ ID 
NO. 164 


KCP 1290 
93 


GGGTCCTAACAGGCCACAGACCCATCCGTGGCCCAGGGGATTGGCG 
ACCCCTGTCTTTTTTTTTTTCTTTTTTTTGAGATGGAGTTTCGCTC 
TTGTTGCCCAGGCTGGAGTGCAATGGCACGATCTCGACTCTTCAAC 
CTCCGCCTCCTGGGTTCAAGCCATTCTCCTCCCTCAGCCTCCCAAG 
TAGCTGGGATTACAGGCACCCGCCACCATACCTGGCTAATTTTTGT 
ATTTTTAGTAGAGATGGGGTTTCTCCATGTTGGTCAGGCTGGTCTT 
GAACTCCCGACCTCAAGTGATCCGCCCACCTCAGCCTCCCAAAGTG 
CTGGGATTACAGGCGTGAGCCACCACGACCTGCCCGGGGACCCCTG 
TCTTAAACCACCCCAGCCTGTGATACTTTGTTATGGTGACCCTAAG 
AGGCAAATACACCCTCCTTTCCCCAACCTCTCCCCTCAGACGAAAC 
CGATGCGAAAAGTGCTTCATGAAGTTTCAGGTAAAGAAGT [C/G] T 
GGGACGAAAAGGGATAGTGAGGATGGCGGGAGGGGCTGAACTCCAA 
ATGGGCTTATCAAGGCTCTGCAAAATGGCGTGACGGCGCTGCCCCC | 


SEQ ID 
NO. 165 



2345.2049-005 



-151- 





TTCTGGTGGCCTGAAGACTAACGCACATGATGTCAAGTGCGGGGCC 
CAAGTACTCAGGAAAAGGTTCTCATTTGGACACTGGGAGGTCTTAC 
ATTGGGGGCCCTGAGCCTCCAGCCCTTCCAAATCTATTCTCAGCAG 
GAGCTCAGCCACACCTGTGTCCCAGAACTGAGGCCAGGCCCAGCCT 
TCACTCCACGCCCAGCCAGCCCCAAGGAACCGACTCCCTGAGGCTC 
TATGCTCCCTGCCTCCAGTGGCCCCGTGTCTGGGAAATAGTGGCCC 
TGGCCTGATGCCCTGACCTGGGCAATCCATCCCCTGGTCCTCTCAG 
CTCCCGGGCCCAGGTTTTCTGGGCTACTTTAACCAGGGCAAACTCA 
TTCCTCGAGTACAAAATAAAAGATTCGAACAGCATAATC 




KCP 1291 
27 


GGTTAGTGGGATGCAGCGCGAGGCTAAGGAGTGTCTGGGGCCACCA 

GAAGCCAGGGAAGCCTAGGAAGGGTTTTCCTAGAGCCTTTGGAGGG 

AGCACAGCCCTGCTGACACCCTGACTTCAGACTCCCAGCCTCCAGA 

GCTGGGAAGGGATAAGTAGCTGTTGCTTTAAACCAGTGGTCCCCAA 

CCCTTTTGGCACCAGAAACCGGTTTTGGTTCAGTGGAAGACAATTT 

TTCCACGGACAGGGTGTGTGGGGTGGGAGATGGTTTCAGGATGAAA 

CTGTTCCGCCTCTGATCATCAGGCATTAGCATTAGTTAGATTCTCA 

TAAGGAGTGAGCAACCTAGATCCTTCGCATGCGCAGTTCGCAATAG 

GGTTCATGCTCCTATGAGAACCTAATGCGGCGGCTGATCTGACAGG 

AGCGGAGCTCAGGCGGTAATGCTTGCTCGCCAGCTCACCTGCTGTG 

CAGCCGGGGTCCTAACAGGCCACAGACCCATCCGTGGCCCAGGGGA 

TTGGCGACCCCTGTCTTTTTTTTTTTCTTTTTTTTGAGATGGAGTT 

TCGCTCTTGTTGCCCAGGCTGGAGTGCAATGGCACGATCTCGACTC 

TTCAACCTCCGCCTCCTGGGTTCAAGCCATTCTCCTCCCTCAGCCT 

CCCAAGTAGCTGGGATTACAGGCACCCGCCACCATACCTGGCTAAT 

TTTTGTATTTTTAGTAGAGATGGGGTTTCTCCATGTTGGTCAGGCT 

GGTCTTGAACTCCCGACCTCAAGTGATCCGCCCACCTCAGCCTCCC 

AAAGTGCTGGGATTACAGGCGTGAGCCACCACGACCTGCCCGGGGA 

CCCCTGTCTTAAACCACCCCAGCCTGTGATACTTTGTTATGGTGAC 

CCTAAGAGGCAAATACACCCTCCTTTCCCCAACCTCTCCCCTCAGA 

CGAAACCGATGCGAAAAGTGCTTCATGAAGTTTCAGGTAAAGAAGT 

CTGGGACGAAAAGGGATAGTGAGGATGGCGGGAG [A/G] GGCTGAA 

CTCCAAATGGGCTTATCAAGGCTCTGCAAAATGGCGTGACGGCGCT 

GCCCCCTTCTGGTGGCCTGAAGACTAACGCACATGATGTCAAGTGC 

GGGGCCCAAGTACTCAGGAAAAGGTTCTCATTTGGACACTGGGAGG 

TCTTACATTGGGGGCCCTGAGCCTCCAGCCCTTCCAAATCTATTCT 

CAGCAGGAGCTCAGCCACACCTGTGTCCCAGAACTGAGGCCAGGCC 

CAGCCTTCACTCCACGCCCAGCCAGCCCCAAGGAACCGACTCCCTG 

AGGCTCTATGCTCCCTGCCTCCAGTGGCCCCGTGTCTGGGAAATAG 

TGGCCCTGGCCTGATGCCCTGACCTGGGCAATCCATCCCCTGGTCC 

TCTCAGCTCCCGGGCCCAGGTTTTCTGGGCTACTTTAACCAGGGCA 

AACTCATTCCTCGAGTACAAAATAAAAGATTCGAACAGCATAATCA 

AATAGGTCATACCCATAAATCAACACATTTGAGCACCTATTTTGTT 

GTTCTTTCACTAATCCAAACCATATTTATTGAGCATCTACTATGTG 

CCATTCTCCAGTAGCCATTCTAGGTGCAGGGGATACAGCAGAGACC 

TTGAAAAAAGGAACAGTCTCTGATCTTGCTGAGCTTAGAGTCAAGT 

GGAGGTGAGGAGGAAGGAAATGAATTAACAACTAAGTGAAGCAGAA 

GGTAACCAATTGATTGACTGACGAAGGGGTACAAACAACAAACACC 

TTCCTTTCTCCAAACTCTATCTTTAACTGTATTCTCTCGTTTTCCT 

TCCTCTCCATTTTACAATCATTTTACAACATCTCTGGCTATTCTCC 

TATATTTCTGATCACTTCGGTTCTCATCACAATAATAATTTCAGTT 

TTCAAGCATTGGAAAGTCCCATCCAATTAAAATGTCAATCTCACAC 

GCAGTTTAAACGTTTCGCCTGCCCGTGAGCTCAGACCTGTCTTGGT 

GCCTCAGTTCTTGTGTGGAGGGGAGGA 


SEQ ID 
NO. 166 


KCP 1296 
90 


TGGTGGCCTGAAGACTAACGCACATGATGTCAAGTGCGGGGCCCAA 
GTACTCAGGAAAAGGTTCTCATTTGGACACTGGGAGGTCTTACATT 
GGGGGCCCTGAGCCTCCAGCCCTTCCAAATCTATTCTCAGCAGGAG 
CTCAGCCACACCTGTGTCCCAGAACTGAGGCCAGGCCCAGCCTTCA 


SEQ ID 
NO. 167 



-152- 



CTCCACGCCCAGCCAGCCCCAAGGAACCGACTCCCTGAGGCTCTAT " " 

GCTCCCTGCCTCCAGTGGCCCCGTGTCTGGGAAATAGTGGCCCTGG 

CCTGATGCCCTGACCTGGGCAATCCATCCCCTGGTCCTCTCAGCTC 

CCGGGCCCAGGTTTTCTGGGCTACTTTAACCAGGGCAAACTCATTC 

CTCGAGTACAAAATAAAAGATTCGAACAGCATAATCAAATAGGTCA 

TACCCATAAATCAACACATTTGAGCACCTATTTTGTTGTTCTTTCA 

CTAATCCAAACCATATTTATTGAGCATCTACTATGTGCCA [G/T] T 

CTCCAGTAGCCATTCTAGGTGCAGGGGATACAGCAGAGACCTTGAA 

AAAAGGAACAGTCTCTGATCTTGCTGAGCTTAGAGTCAAGTGGAGG 

TGAGGAGGAAGGAAATGAATTAACAACTAAGTGAAGCAGAAGGTAA 

CCAATTGATTGACTGACGAAGGGGTACAAACAACAAACACCTTCCT 

TTCTCCAAACTCTATCTTTAACTGTATTCTCTCGTTTTCCTTCCTC 

TCCATTTTACAATCATTTTACAACATCTCTGGCTATTCTCCTATAT 

TTCTGATCACTTCGGTTCTCATCACAATAATAATTTCAGTTTTCAA 

GCATTGGAAAGTCCCATCCAATTAAAATGTCAATCTCACACGCAGT 

TTAAACGTTTCGCCTGCCCGTGAGCTCAGACCTGTCTTGGTGCCTC 

AGTTCTTGTGTGGAGGGGAGGAGAGGAGAGGGGAGGGGAGGAGAGG 

AAAGGAGACCGGGGAGGTGGGGGGGGAGAGGGGAGGGGA 

KC P_l 3 0 3 CTATCTTTAACTGTATTCTCTCGTTTTCCTTCCTCTCCATTTTACA SEQ ID 
09 ATCATTTTACAACATCTCTGGCTATTCTCCTATATTTCTGATCACT NO. 168 

TCGGTTCTCATCACAATAATAATTTCAGTTTTCAAGCATTGGAAAG 
TCCCATCCAATTAAAATGTCAATCTCACACGCAGTTTAAACGTTTC 
GCCTGCCCGTGAGCTCAGACCTGTCTTGGTGCCTCAGTTCTTGTGT 
GGAGGGGAGGAGAGGAGAGGGGAGGGGAGGAGAGGAAAGGAGACCG 
GGGAGGTGGGGGGGGAGAGGGGAGGGGAGGAGAGGGGAGGGGAGTG 
GGGGAGAAGGGGAGAAAAGCGCAGCTGGCTTCCTCACTCTCCTTTC 
CTTCCTCACCATCCTTACCCTGGCCCAGGGCAGGAGGAGGATTGGC 
AGAGTAGA [A/G] GCAGGGTCTTCTGTCTTAGCTGGGCCTGTTGGT 
GACTTTCTGTTGGCCAACATGGGCTGACTGGAATGTTCTCCAGCAT 
GGCACATGGTCATCCAGATGCAGGCTCTTCCCTGGGGCACTATAGC 
AGAGAGGGCTCTCTTCCAGTCTATTGCAGATGGATGCCCTCGTGAG 
CTGAGTTTTGATGAACATCCCATGTCCCCAGCCACCCCATTCAGAG 
CCTCTTTCTACTCTGGTCCTCTGGTCCCAGCAGCAGCCCTCTGGGT 
ACTGAGGGGAGGGCATCTCACCCAAGCCCCTTAAACCTGCTCACCT 
TCTTCAGAGCCCACGTGGCCGCAGGAAAGTCACAAACCCTTGTGCT 
CCCACAGGGCACACGTGTGCACACGTGTGCAGCTACCTTCTCTCTA 
GTTGGTACCTGAGGCTGCCTCCTGGATTTTCCAGTCTCTGTGTTCC 

CAGACAACCCCAAGCCCCAAGAATACAA 

KCP_13 05 AGTTTAAACGTTTCGCCTGCCCGTGAGCTCAGACCTGTCTTGGTGC SEQ ID 
57 CTCAGTTCTTGTGTGGAGGGGAGGAGAGGAGAGGGGAGGGGAGGAG NO. 169 

AGGAAAGGAGACCGGGGAGGTGGGGGGGGAGAGGGGAGGGGAGGAG 
AGGGGAGGGGAGTGGGGGAGAAGGGGAGAAAAGCGCAGCTGGCTTC 
CTCACTCTCCTTTCCTTCCTCACCATCCTTACCCTGGCCCAGGGCA 
GGAGGAGGATTGGCAGAGTAGAGGCAGGGTCTTCTGTCTTAGCTGG 
GCCTGTTGGTGACTTTCTGTTGGCCAACATGGGCTGACTGGAATGT 
TCTCCAGCATGGCACATGGTCATCCAGATGCAGGCTCTTCCCTGGG ; 
GCACTATAGCAGAGAGGGCTCTCTTCCAGTCTATTGCAGATGGATG 
CCCTCGTGAGCTGAGTTTTGATGAACATCCCATGTCCCCAGCCACC 
CCATTCAGAGCCTCTTTCTACTCTGGTCCTCTGGTCCCAG [C/G] A 
GCAGCCCTCTGGGTACTGAGGGGAGGGCATCTCACCCAAGCCCCTT 
AAACCTGCTCACCTTCTTCAGAGCCCACGTGGCCGCAGGAAAGTCA 
CAAACCCTTGTGCTCCCACAGGGCACACGTGTGCACACGTGTGCAG 
CTACCTTCTCTCTAGTTGGTACCTGAGGCTGCCTCCTGGATTTTCC 
AGTCTCTGTGTTCCCAGACAACCCCAAGCCCCAAGAATACAAGAGC 
TCTGTCACCAAGCATCGGGCCTGTGGCTGCACTACACGTCTGCAGC 
TCAGGACCCCTGGCTGCGGCGTAAGCTACCAGCATCCCCTTCTCAT 
GGGCACCCTCATCTCCGGCTCCCCATCGCTGGGCTGTGACCTGCGG 



-153- 





GGGCGCCCCTCTATGGAAGGGAAGGAGAAAAATTCACAGTGCTATC 
TACTCCTCTGAATGCACTCCCACCAATTTCCTTGGAAATTTCTAGC 
TTTCACTGACATATCTGGGATGGGGCGGTGGTCACAAAA 




KCP_1312 
44 


GTCTCTGTGTTCCCAGACAACCCCAAGCCCCAAGAATACAAGAGCT 
CTGTCACCAAGCATCGGGCCTGTGGCTGCACTACACGTCTGCAGCT 
CAGGACCCCTGGCTGCGGCGTAAGCTACCAGCATCCCCTTCTCATG 
GGCACCCTCATCTCCGGCTCCCCATCGCTGGGCTGTGACCTGCGGG 
GGCGCCCCTCTATGGAAGGGAAGGAGAAAAATTCACAGTGCTATCT 
ACTCCTCTGAATGCACTCCCACCAATTTCCTTGGAAATTTCTAGCT 
TTCACTGACATATCTGGGATGGGGCGGTGGTCACAAAATCAATCCC 
ACTTTCCCTCGGCTAGTCTTACAAGCACCCAACAGCTCTATTCAGA 
ATACAGGGCTGCCCAGCTACTTCCCATTCATTATCCCCAGGTTGCA 
AGCTTTAGTCAAAACCCAGAGGCAGCAGGGTGTCTGGTTCCACCTG 
CTGTTAGGATGATTTCAGGAGTGCAAAGTGTTAGAAACGC [A/G] G 
TAAAACATGATGCTTAGAGATTAAGTGGGATGGGGACTGGGCAGAT 
GATGCTGCTTTGGACCCAGCGAGTGAGGTGAGACTGCGACAAGACA 
GAGCCACTGAGCAGTGACCTGGGGGATGGGCATTGCAGGCAAGGCA 
GAACCCCAAGTGGGAACAACCTCACTGGGCTTAGCAAAACTAAAGA 
GGCCCAAAGTATACTGAGCGATGAGGTGAGTGGCGTGGGATAAGGT 
TGGAGAGGAGGCTGGAACCAGACCCTGCAGGGCCTTGCAGGTGATG 
GGAAGGAGTTTGGAAGGTGCTGGAAGGTTTGAAGCAGAGGAGGGAT 
ATGATCATGCCTGTAGCTGCTATGTAGAACAACTGTATGCATGCCA 
GGCCTGTGCCACGCATGCTCTAATCATTACTGGCTTTAACCCTTGC 
ACTAACGTTGTCATGCAGGTAGGAGCATCTGCACCCAGCAAATGGA 
AACTGAAGCTCAGGAATATTCAGTCACTTGTCCAAGGCT 


SEQ ID 
NO. 170 


KCP 1318 
54 


ACCTGGGGGATGGGCATTGCAGGCAAGGCAGAACCCCAAGTGGGAA 

CAACCTCACTGGGCTTAGCAAAACTAAAGAGGCCCAAAGTATACTG 

AGCGATGAGGTGAGTGGCGTGGGATAAGGTTGGAGAGGAGGCTGGA 

ACCAGACCCTGCAGGGCCTTGCAGGTGATGGGAAGGAGTTTGGAAG 

GTGCTGGAAGGTTTGAAGCAGAGGAGGGATATGATCATGCCTGTAG 

CTGCTATGTAGAACAACTGTATGCATGCCAGGCCTGTGCCACGCAT 

GCTCTAATCATTACTGGCTTTAACCCTTGCACTAACGTTGTCATGC 

AGGTAGGAGCATCTGCACCCAGCAAATGGAAACTGAAGCTCAGGAA 

TATTCAGTCACTTGTCCAAGGCTCCCCAGCTGTTAGGTGCTAAGGC 

TGGATTCAATCCAGGACTTGCAGACTCCAGTATCTTGGCTTTTCTA 

ACGAGAGTGTGCTAGCTTTCTAATGGGGGTGGGGAAGGCA [G/T] T 

CTGCCCCCCTCCCATGGCACCGTGAGCAGGTGTCACTGCTCCAGCC 

AGTACGCCTGGACACCGACTAGGAAGGAGTATGTGCTACTAGGAGG \ 

GATGGTCTGGGCTGACTCTTTGAAGTTGACAAGGAGTTGCATAATC 

CCAGCTAATAATTATGCTGGACCAGGGGCAGAGACATTACTCCAAG 

GGTGACCAGGTGTGGAGAAGAGGCTGCTGACTCCGGGGCCCCAGGA 

CCTGGCCCCCAGGTCTCATTGCCCGAGTGCTGCCCCAGAAGGAGTA 

GAAGCTGGAGCTGTCCGGGCCACAGCCGAGGCTGGGTGAATGCTGC 

AGTGAGGCTGCCGCACAAGTTGCGTGTTGTGACATTTGTCTTCTGG 

AGGGGATTGGGATGGGCTACTTCAGCATTTAAAAACCCCTACTAGG 

TCTGAGAAATCCCCTCAGCTTATGAGCCTGGGTGGGCAGCAGGCCT 

TCTCAAGAAGCCCAGAAGGCCAGATGCTCACTTCCCAGG 


SEQ ID 
NO. 171 


KCP_1326 
77 


CAGTGAGGCTGCCGCACAAGTTGCGTGTTGTGACATTTGTCTTCTG 
GAGGGGATTGGGATGGGCTACTTCAGCATTTAAAAACCCCTACTAG 
GTCTGAGAAATCCCCTCAGCTTATGAGCCTGGGTGGGCAGCAGGCC 
TTCTCAAGAAGCCCAGAAGGCCAGATGCTCACTTCCCAGGCTCTCT 
TGCGGCTGAGCTGAGAGCAGGCACCTGAGGCCTGGCAAGTGTGACA 
GCTGGTGACACAGACAGACAGGGACAGGGAGATGGGACTGTGCCTG 
CAGCGGTAGCCCTGGCCGGTGTTCAGTGGGGCCAGCATCCGTGTCT 
TTCCTGGGGGCCAGTGGGGGCCGTGGCTCTGACGATGCATCCCTCC 
CCCACGTTTTTTCTCTTCTTGTCTTGGACTTTGCAGGGAGCACTCT 
GCTTTTGGGAACAGGAGCTGGGTCTCTGGCCATTCTCCGCAGCCCC 


SEQ ID 
NO. 172 



2345.2049-005 



-154- 





TCACCATTCACTCAGTGGCTCTCAAAAAATAGAACCTGGG [A/G] C 
AAAGCTGTTCTTGGCCCCAAACAACATGAGGAAAAATAAATAAATA 
ATGTACCTGGTAACTGAGAGAGTTCCCTCTGCATCTTGGGCTCTTT 
CAATGAGATGTCCTCTGCCTGCAGCAAGCCCCAAGGGCTTCCCTCA 
CCAGGACCAGCACCCTGGTTTGCCTGACCCCACACCTGCCAATGCC 
GGGGCAAGAATGTCCCAGGCTGCCCTGGTTCCCAGAGCTGATGCTT 
CCCACAGTGCCCAGCTGTGCTGGCATGGAGCTAAGGACAGGGCCAG 
TCCCAAGAAAACAACAAGGCTCCAGGGCCACCGGCCACTGCTCAGG 
ACCCTGGCTGACCCCACAGATGCGGAGTGCCTGAGATGGCTCATGG 
GTGACCCCCAGGCATCTGGCAAAGGTCACAATGGCTGTTTGGCTTG 
AAGACAGCCCTTGCAAGATCTGTTTTGAGCCAACCTGTGGCATTTA 
GCCCTCCCTGGGTGACAAATAAAAAGGCTGAGGCTTGTA 




KCP_134 0 
45 


ATTGGAAGATAAGAATGCAGCCAAAGAGGTCTTCAGGGTAGCGTGT 
GGCCTGGGCGCTGAGACTATTGGGCCTAGCAACTTCTCAAGCAGTC 
TATTAACCACAGCCGGTAGCCAGCTTTTCCCCGCCCTTCTCCCAGG 
CACACACAGCCACCTCCATCACCAAAGGTCAGGCGAACCACCTCCC 
ATGGCTACCCCCAGCCTGACTTGCTTTATAGAAATCATGGCATCTC 
ATCCTCACAACAGCCCACACTCACAGTGAATCTTGGCCATTATGAC 
AACTGGGGACACTGAGGCTCGGAGTGGTGGAAATTCTCAGAATCAC 
ATAACAATAAGTGTTAAAGTCAGAATTTCAACTTCATCTCTCTAAC 
TCCAAAGGGCGTGTGTGTGTGTGTGCGTTTCTGGCCATAATCATAT 
TGTGCCCTACAAGCCCCAGTGAGGAATCTGCTAGGAACACTGGTTT 
GGGGAAAAAATGTAATAAAATATGTGATC CAGAAGGCGGC [C/T] T 
TGGTACCTGTCATAAACCGCAGCATGGGGTACTCACTATGCCTGGG 
GTCTGGGCTCTGAAGGCATGATTGAATGATCTCACTGCAGGCCTGG 
TTGTCCTGCGAAGACACCCGTCAATACATGAATATTGACACACAAC 
GCTGCAGTGCACGCGCTTCTGGCAGGGGAGCTGCTGCACTCGAGGG 
CAGCTCAAGGTTAATTTGCAGGGTTCATGTTTGGAGTTTCTGAGCA 
AGTGTTGCAGCTTTGGCCCCCAGCCCCCTGAGGGGAGCTCTGGCCG 
TGCATGAGGGTCAGACAGAAAATCTCCTTTCCTCCATCCAGGCCTG 
CAGTCTGCAGCACTGAGGTCAGCGCTGGCCACAAGCCCACCCTGTG 
CCTCGTCAGCCCCACTGAGCCTCTCCATCTATCATGCCACAGGCTG 
ACCCTGAAATGCAAAATCATTCTGTCCTCCCGCCCTCCACTCCCAC 
CTCGCACATCTATGGATTTGCTGTTCAGAAAACATCTGT 


SEQID 
NO. 173 


KCP_13 55 
18 


AGTTAGATTGAGCTGGGTCCCATCTTGGGCCTTTGCTGGTCCCTCC 
CTAGAAAGTCTGCCCCCTCCCCCTGCAGGGTGGCATCAGCATTCAG 
GCCTGGCCCTGACGCCCTCCTCTCTGGGCCACCTTCACCTCCACAA 
CCCCGGCACCAGCACCCATCCCCACCACATCCCCAGCACGCAGCAT 
CTAGTAAGGGCACCAAATGCATGCCCAGACATATGAGTGAAATGAA 
TTAACCCTGAACCTGAAAAAGGGCAACCACCACACAAGATTCTCTA 
GAAACAATGTGAATTGTGCAGAAGGAAATTAACCCTACTCCATCCA 
GCCCATCCTAAGGCAGGGACTTGGACCTGTTCCTCTTGATGGGGCT 
GGGGCTGAGGCGGGCAAGGCAGGCAAGTGCTGAACAGTTGGCAACA 
TTGCCCATCCCGTCTCCCTGCACCAGGCTGGGCCTGGGGTGAGGGG 
GTGGGGGCCGGGGTAGCTGGGCTCCTCCAGCAAAGAGCAG [G/T] A 
CTGAGTCCCTGGTGACTATTAGGTAAAAGGTCCCTGACAATTTTGA 
GGGGCCAGATGCCAACTCGAGGGATACAGAGAAGATCTAGGCACAG 
TCTTTCCCCACCATGTCAGACAAAAAGGTTAGATACAGGACCTGAT 
ATGTTATAAAACTCAATCAATATTTACTTAGTGAATAAATGGACGG 
ATGGATGGATGGATGCATTAGGCAGCCAAGTGGGCAGCACCGATGA 
CTTAATGTACTGAGTGCTCCGACTCCAGCAACATGCATTCATTGTT 
CCTACTGTGTGCCAGTGAACAAGAGCAATGAACTCAATGACTTCTG 
CCCAGGGTGGGCCAGGGAACCAGGGAAGACTCTCCAAAAAGGCAGC 
ATTTGGGCTGGGACGTACAGATGAGTAGGGGGTCGAGTGTGTCGTT 
ATGTCGCTGGAGCCCAGAGGCGTCCATCAGGACTTGGGGGAGGGCA 
GATGAAAGGGCCTTACTGCCTAACTTGGAGCCACTGTAT 


SEQ ID 
NO. 174 


KCP 1360 


CCCATCTTGGGCCTTTGCTGGTCCCTCCCTAGAAAGTCTGCCCCCT 


SEQID 



2345.2049-005 



-155- 



36 


CCCCCTGCAGGGTGGCATCAGCATTCAGGCCTGGCCCTGACGCCCT 

CCTCTCTGGGCCACCTTCACCTCCACAACCCCGGCACCAGCACCCA 

TCCCCACCACATCCCCAGCACGCAGCATCTAGTAAGGGCACCAAAT 

GCATGCCCAGACATATGAGTGAAATGAATTAACCCTGAACCTGAAA 

AAGGGCAACCACCACACAAGATTCTCTAGAAACAATGTGAATTGTG 

CAGAAGGAAATTAACCCTACTCCATCCAGCCCATCCTAAGGCAGGG 

ACTTGGACCTGTTCCTCTTGATGGGGCTGGGGCTGAGGCGGGCAAG 

GCAGGCAAGTGCTGAACAGTTGGCAACATTGCCCATCCCGTCTCCC 

TGCACCAGGCTGGGCCTGGGGTGAGGGGGTGGGGGCCGGGGTAGCT 

GGGCTCCTCCAGCAAAGAGCAGGACTGAGTCCCTGGTGACTATTAG 

GTAAAAGGTCCCTGACAATTTTGAGGGGCCAGATGCCAACTCGAGG 

GATACAGAGAAGATCTAGGCACAGTCTTTCCCCACCATGTCAGACA 

AAAAGGTTAGATACAGGACCTGATATGTTATAAAACTCAATCAATA 

TTTACTTAGTGAATAAATGGACGGATGGATGGATGGATGCATTAGG 

CAGCCAAGTGGGCAGCACCGATGACTTAATGTACTGAGTGCTCCGA 

CTCCAGCAACATGCATTCATTGTTCCTACTGTGTGCCAGTGAACAA 

GAGCAATGAACTCAATGACTTCTGCCCAGGGTGGGCCAGGGAACCA 

GGGAAGACTCTCCAAAAAGGCAGCATTTGGGCTGGGACGTACAGAT 

GAGTAGGGGGTCGAGTGTGTCGTTATGTCGCTGGAGCCCAGAGGCG 

TCCATCAGGACTTGGGGGAGGGCAGATGAAAGGGCCTTACTGCCTA 

ACTTGGAGCCACTGTATGTTTCAAAACAAAGGAG [A/C] GAGAGGA 

TCCTGGGAAAGAGAAAGGGTACTCTAGGCAGAGGATGTGAATGGGC 

ACAGCACAGGTGAGAACATCAAGACCAGGGGTCAGGGAATCTACTG 

GTAAACAATTGTACCCCAAGGGAGCAATCACAGCCTCTCCATCCAC 

AGGGAAATGCCTGGTGGGGAGGAATGGGAGGAAAGAAACAGATTGC 

ATGACTGTGTCTTGAAGGTCTAATTCCAGAGTACAGCATCACCCCT 

ATCTTCCAGGTCCAGAAACTGAGGCTCAGAGGGAGACTTTCTGATG 

AGTGCAGCGTGCAGATAAGAGCATCTCCAAAGCTACCTCCTTCCCC 

AGTCACACCAGGGCATAAGCAACTGATAACAGCTGTCAGCACGGGA 

CAGTGGAGGGAACACTAGGTTAGGAATAAGGGTACGAGGCTTGAGT 

ACAGATTGTCAATGACTCAGTGTGTGAACTTGGTCAGGTGACTCCA 

ACCAGATGACTTCCTTCTCTGAGCTTCTGTTCCCTCCTCTATGAAT 

GGGGACAATCACTCAGCTTCACAAAACAATGGCTGCGAAATTGCCT 

GGTACAAGAGAGAGAACTTCCAGTGTGTAGGGGCTGTTGTCCTAAC 

TGCCCAGCCCCCTAGATAGGTAGTTATGTCATCTGTGAAATGGGTG 

TTAGAATTCCTACCTCCCAGGACAGCTGTGGGCAGAAAACCAAAGA 

ATGTGTGTGAGAGCCCAAGCACCATGCCTGGCACATAGTAGGTGCT 

CAGGAAAGGCTGAGGGTGCAGCTGCTGTCCACACACATGGTACCAC 

TGCCCCAGGAAGGGGCTTCAGGAACCAAGAGCAATTCTGAGCACTG 

GTGACTGGACTCTGCCATTCTCCATTTCAAACGCTTTTTGAAAGCA 

GCTCCAGACCCAAGCAGGAGAGCAGGAGGCAAAAGAAACGCAGGGG 

CTTTCCCGAATGGAATTTTAGAAACACACAGAATTGTCTCCTGCAC 

AGAAGGGAAGCTGTCTTCCACAGCACA 


NO. 175 


KCP 1376 
60 


CACTGGAGCTGAGACTCCCAGGTCCCCTAGGGCTTCTCTCCCAGGG 
GCCTCTGGGCTCCCCAAGGCCACGTGCTGCCCCCACTAGAGACCTG 
GGCCAGTCCTGACCAGGGGAAAGAGTAGCGCCGACAACAGCCCCAG 
ATGGTATGTGCACTGGCACATACTGGCAGCTGCCTTCATGACAGCA 
AGCCATAGGTCCAAATCCCGCCCCTTCACAGGGACATTCCCAACTG 
GTCAGGGGTGGACCTCCCCTTCCCGGCTGTCTTTGGTGTCCAGGAC 
GATTTGCCACAGACAGGGGGAGCTAAAGGGGCCCACGCTTGAGGCC 
GCTCAGCTCTGAGTCCTCGCCGGCCACAGAGGACCTTCGTGCCTGT 
CCTCTGTCCTCCTGCCCAGTCCCCAGGCCAGGCTCAGCTGGAGTTG 
GGGAGCAGAAAAACACGCATCTGAATCAAGGCTCTCGGAGCCTTTG 
CTTCTGCCTCCAAGAGGCGAGGGAAAATGAATACCCAGGC [A/G] A 
GCGAGCAAGAGAGACCCTCAGAAAACCCCAGATGCCCCTGGAATCA 
AGCCCTGTCCCACCAACGCCACGTGGATTGACAGGCTATTAGTCTT 
CCTGTAATTAGGATTCTCGCCTCAAATCTTGTATCTTTTTCCCCCA 


SEQID 
NO. 176 



2345.2049-005 



-156- 





GAAGATTCTCCTCCAGCCTTCACCACTGCCCCCTGGCGCTTCCTTG 
CAAGGCTTTTGAAGAATCCTTTGCAGAGAAGCAGCCTCCTTTGGCA 
GGGGCTGCAGAGCACTCTGCCTCCCTAGGCCAGGGCGAACCAACAG 
AGGCGGGAGATGAGGAGGAGCAGCGCGGCTCTGCTGCGTGGCCCTG 
GGCAAGCACCACAACCTCTCTGGGCCGTTTGCACATTCTTACCGCC 
AGGGATGTGGGCGGTAAATGAAAGAGACCAGCACAAACCAGTGTCA 
GCTCCCTTCCTCGATTCCTAAAATGTGATGCCCAAAGATGGGCCAG 
CCTCCTGCTGTGCCTTCTCTGGGGGGACATTTAATAAGT 




KCP 1436 
12 


TGGCCGCCTCTTCCAGATAACAACTCTCCTCTCCTTCCCTGCCCTC 
CTGCTCCTCCTGTTCGCGCTACATAACAGACTCTGTGGGGCCTTGG 
TTTATGTATTTCCTTCTCTCCCCTACTGAAATACATGTGAGCGATG 
CTGGGGCAGGCCGACTAGAAGAAGCAGACTATCTGCTTCTTCTCCA 
CCCTTAGAATGGTGCTGGGCCCAGAAGAGGCATGCAGTCGATATTT 
GCTGAATAAATGAATGTCAGATAAAGTGGTGTGGGGACTCCAGGGG 
AAAGATTTGTCATTCTCCACCCTCCCAGTTCAGCTTAAAGCAGAGA 
AGTGAGAGGTGCCCAAAAAGGGGTGTGTCTGGGGGGTGGGGGGTGG 
GGATGTTCCAAGATCTCCAAGGCCTGGATTTTAAGCAAGGTTTGAG 
ATGCCAGCAAGAGGGCCTGGCATTGCCAGATTGATAGTCTGCATTT 
CAGAGAAGGACAACCCCACCTCTGACCTTAGCCC [A/G] AGCCTCA 
ACAGCCTGCTCAAGGAGATCCACCCTTAGTAGGAGGAGGCAGCCAG 
GCCAGGTTCCAGTCCCTGCCACCGCTTGCCAGGTGTGTCTTGGGCA 
GCAGTTGCCTTTGCTCGGTGGTCTTCAGCTTTGCCCCCTGCCAGGC 
ACGTGCTGGCCTCCTGCCTGCATCGTAGCTCATGGAGTCCTCTCAG 
TCACCTCTGTATGCCCTGCAGCATCCCCAGTTCTCAGTGAGAAGAG 
TGTGCTCTGAAAGTTAAGTAACTTACCCAAGGTCACACAAGGTCTG 
AGTCTCAAATGCATACAATTTGACCCCATAGTCTAAGGTCTTGACC 
GCAATGGAATAAGAAATTATTTTACCATTCTGAGTGGCAGTCTCTG 
AAGACTACAGCAATAATTGATGCCTCTCAGGGGGATAGGTGTGTCA 
CTTACAGGTGATAGTGAGGTTGTCCTCAGCCTCCCTGCTCTTCGTT 
AGACCTCCCTCCTCCTCTCTACCCGGGCCAAGCGT 


SEQ ID 
NO. 177 


KCP 144 9 
60 


GCGGAACACCTCTGCCGCACCTGCAGCAGCCTTGCTCTATTTCTTC 
ACAAGCTTCCCCATGACACTGACCCAAGGCTGTCTGGCCACTACAG 
CTGCTGATGATGATTAGCAATAATAATAATAATAAACGAAATGCCT 
TCTGCTTAGATCATCTTTAATTTCCCCTCCAGAATGACATTCGACT 
CTGCTTAGAGTTACAGGCAGCCCAGCAATTACTGAGCGCAAATACC 
GTGTTCACCCGCCTCACCTCATCCACGCCCCCACAACACCCAGCCC 
TGAGACTGGCTCCACGATCACCTCCACTTTATAAAATAAGATATCA 
AACTCTGAACAGAACGGACGTCTCAAAAAATGGGCATATTACATTT 
AAACCCTCAATCTGTTGGGTATTTGAGTGAAATGGACATACCTCCA 
GGGAGTCGGTGGCGAGGGCCGGCTCTGAGGACTTCCTGGGTTGGGA 
TCCTGGCTCTGCAGGACTGCGTGACCTTGGTGAGTTACTT [C/T] A 
TCCCTCCAAACGCGCTGTTCTCCTTCATAGAATGGAGATGACCACA 
GGGCCAGATTCATAAGGTTGTTCCTTGTAATACAGGTGAATATCCA 
TACCCAGCAACTGCTGGACCACCTGTGGTTTCAAGGATAATTTCCC 
TCCCACGTCCCCGTGGCCCTTGGAACCTTCCTCTCCTCCTGTCTCC 
CCCTGCCCCCATCACTTTGTAATTGAAAAGTCATGATTGCTCTCCC 
AGGTGTAGCACTGCTCACAGGTCAGATTGCCTGCTCTGACGTAGTG 
ACTCAGTTGGATGCGGTTCAGCTGTGTATGATCAACTCCCTCCCCC 
TGACAAAAACATTATTTTGCATCACAGAGAAGTTGATTTCTTTCAC 
ACATAAAAGAAGGCAAAAAGTGGTGCCTAAAGGGCTGGTACAGCAG 
CTTCAAGAAATCAGGAAGAACCTGGGCTCCTTCTGCCTTCTTGTTC 
TGCCAATATCACCCCATGGCTGCCACTTCATGGCCCAAG 


SEQ ID 
NO. 178 


KCP 1467 
46 


TTGTGAGTAGGGCACGCAGGGAAGAAACCTGTTCAACCCAGCCCCG 
TGCTAGAAAGACATCAGCAGGGCCTGCAAAAGCCCTGATTAAATCT 
CACAAGTTTGCACCTGGAGCCGCCATCTTGAATTGCAGGTGAATAT 
CAGCCTTTGGTTTGGGCTGTGTGCCCCAGATGATGGTGGTCCCAAA 
TTACATAGGCCAATATCCAGAGCTGGGTTAAAATGAAGCATTTCGA 


SEQ ID 
NO. 179 



-157- 





GGAAAAAAATGCAATGAAATTTGTTTAACCGGTACTTCAGGCTTTT 

GAGCACAGAACAGCGTCCATCCCTCCAAACACACACTGAGGATATA 

CACTTAGCCAGGAGGGAACATAAGGAGGGGTGGACAAGCCATGTTT 

ACTAAAATCTCTCAGTGTGTGCCAGGCATGTTCATGTATATTCAGG 

AAGAAGTGTCAGTATTTAAGATCCTCGGCCCTTGCCCGAGTCCCCA 

ACACGCCTTCTTGTCTGGAGAACTGTAAATCTTGGAAACATCTTGC 

AAGGGGGGACACCTCACAGAAGGCAGGCTTGGCATGGGATAAACAG 

AATCGACTCCTCTGCTTCCTTCTGATGCACAGTGAATGGGCAGGTG 

GAAGCATCGTTGCTTAAAGAGGAACCAAAACTCCACCCCAGAGCTG 

CTAATTCCTTTTGGCTTGCAGTTATGCAGAGGGCTAAAAAATCCAA 

CGAATCACAAATCCCCTGGTTGCTAAGTAGAAAGAATATGTTTTGG 

CTGCTGCTGTTCCCTTCCCCAAGGAAAAGATTCAAGCAGAGGCGGT 

CCCCACCTCTCAACACAGAAAGCAACATCTCTGATTGCCTCTAGAC 

ACACCTTCATGCTCGTGGCACTTTGGGACCCTCTGCCCGCTGGCTT 

ATGGGCATGGCTTCCCCATCACTCTGGGTCCTTGGGAAGAGCCTCT 

TTCCCAGACCCCACCTCTGTGCCTCATCACATTTCTCCCAGGCTAT 

TGACTTGTTCAAGGTTAAGGTATGAAGAGAGTCA [C/T] GCAGCAG 

CCCTACCTGGCTCTGCTCTGCTGGGGGAAGCCTTTTCAGAGCCTGC 

CTCTTCCTCAGCATGAGGGGCTGCTCGGGCCCAGTCCCAGAGGCCA 

TGCTGGTCCCAGGGGAAGGTGGCCGTCATCCCCATCTGTGTTTTCT 

CTTGCAGGTAAGTCATGCTCCAGCAGTCGGGAGGGTTGTGTGATGA 

CACACTTGGCAGTTTGGGAGCAAAAGCCGCCACAGTAAGACACAAT 

TGATTCATTGCCTCTCAACCCTCTGCTGGGGTGGACTTTCATGCGT 

GGACTTCTGTCCCCAAAGAGGCTTCTCTGGGTCTGGAAAGGGCCCT 

AGCCTTGGTTGGGGGAGGCAAAGGGGTGGCGGCTTCCAGGTACCAT 

CTGGCCAGGAACCGGCTCCATTGTCTGTGCATGTAGCTTGCACTGG 

GCTGCCTGCTCCAAGGGAGGCATCTCCCCACGATCTACGACATTGG 

CTTCAAAGAGCTGCTCCTGGCAGCTTCGAATGGCTGAGACCTACTG 

GCATGGGATGGAGGAGTGCAGGGAGCTTCCCGGGACCTCGCTAGTC 

CTGCCTGGATGCTCAGAAGGCCCTCGTCCTCGGTGGCATGCAGCCT 

CGGCCATTTCCAAACTCACGGCATCTCACCCAGCCATGTCACCCAC 

CCCCGGCTCTGTCGCCCTTCCCATCACCTTTCTCCCACCCATCACC 

TCACATCAAGGTTTCAGCCAGCGGGAACCAGGTTTAGACTCCAATT 

ACCTGTGCGTGTGGGAGGTTGGATTGTGACATCTTTGGAGGGCCGG 

GCTTCTGAAGCGACATTTGATTTCTGGTACTGAAATGTCAAAGGGT 

CCTGAGGCACCCGCTAGGGCAGCACGCGGAGCATCCACCTGCGTGC 

GCATCCTGGGCTCTCTCTGGGCCACTTGGTGCTGGGGACATGCCGG 

GAGCTGGTGGTCAGCCCTCCTCCTGCCTCCTCAGTGCTGCATCTTC 

ACCTTCTGCAGCTGCCTACCAGAAGCA 




KCP 1492 
16 


ACACCTTGACTTTAGCCCAGTGCAACTGACTCCACATTTCTGGCTC 
CAGAACTGTAAGAGAATACATTTGTGTTTTGTTAAGCTAGCAAATT 
TGCAGTAATTTATGACAGCGC TATGAGAAACCAAAACAC CAGGATT 
ATGCCCCAAGGATCCTGATGCCCTCCCTCCTCTCTGCTCTGCAGTG 
TGCTGGAGCTCACAGGGCTCTGCTGCTGGGAGTTAGTATCTAGTCC 
AACACTTTACCCACTCACCCCCCAAGCTAAGGGACTCCTGAAATCA 
GGGACCAGATGCATAATAGGTGCCCAGGAAGTGAGACTCGCCTTCC 
CCAGATTAAGAATAAAGAAGACAAACTATCCACGGCTGCTGTGAGC 
CTCTCATCAGACCTCAGCTTCTAGGGCAGGGTCCCTGCCTGTCTCC 
AGTATGTGGCCTCTGTGTCTTCTTCGCCCTCCATCCCCACAGTGGG 
ACGAGAAGTCATCAGGAAGGCAGGGGATCTGCAGGCAGCC [A/G] T 
CAGGGCTCTAATTGCAGCTGGCTGGGGGACCATGGGTCAGGGCTGC 
CACCCCCTGGCTCTGTGCCTTCACCTGTGTAACGAATGGGGCACTC 
ACAGCCCCTCTCAAGTGGTCCTGGGGATGAAGTGAGAAGGTGACAT 
ATACAAGTGAGTTATACACGTTCCTGTTCTGTCACTCACCAGTGCT 
CACTGGGTGGGTCACTGAACTCCCCTCAGCGTTTCCTTCTCCATCT 
GTAAACCACCAGTGCAAACCTTTCCCAGATAGTGCTGACCCGAAGC 
AGGAACCAGTGCCCCTCTGCCCTCAGTAAGTCTGCCAGCAGAGGAA 


SEQ ID 
NO. 180 



2345.2049-005 



-158- 





GCCCATAGAGGGTCTTGGGAAATGAAGCCAACAGAGTCAAGAGGGT 
CAGATGATGAGGGACTTCAAGTGCCACCTTCATCCCATTCTTTCTG 
CAAATATTCACCACACACCTACGTGACCTCAGGCTCTGTGTCAGGT 
CCTGGGGATGTAATGGTGTCCATGAAGAAACAAGGTCCC 




KCP 1495 
35 


TCCCCAGATTAAGAATAAAGAAGACAAACTATCCACGGCTGCTGTG 
AGCCTCTCATCAGACCTCAGCTTCTAGGGCAGGGTCCCTGCCTGTC 
TCCAGTATGTGGCCTCTGTGTCTTCTTCGCCCTCCATCCCCACAGT 
GGGACGAGAAGTCATCAGGAAGGCAGGGGATCTGCAGGCAGCCATC 
AGGGCTCTAATTGCAGCTGGCTGGGGGACCATGGGTCAGGGCTGCC 
ACCCCCTGGCTCTGTGCCTTCACCTGTGTAACGAATGGGGCACTCA 
CAGCCCCTCTCAAGTGGTCCTGGGGATGAAGTGAGAAGGTGACATA 
TACAAGTGAGTTATACACGTTCCTGTTCTGTCACTCACCAGTGCTC 
ACTGGGTGGGTCACTGAACTCCCCTCAGCGTTTCCTTCTCCATCTG 
TAAACCACCAGTGCAAACCTTTCCCAGATAGTGCTGACCCGAAGCA 
GGAACCAGTGCCCCTCTGCCCTCAGTAAGTCTGCCAGCAG [A/G] G 
GAAGCCCATAGAGGGTCTTGGGAAATGAAGCCAACAGAGTCAAGAG 
GGTCAGATGATGAGGGACTTCAAGTGCCACCTTCATCCCATTCTTT 
CTGCAAATATTCACCACACACCTACGTGACCTCAGGCTCTGTGTCA 
GGTC CTGGGGATGTAATGGTGTC CATGAAGAAACAAGGTCCCTGCC 
CTCATAGAGTGGCCTGACATATGCCCGAGGCAGTCAGCAGCCGAGT 
GCGGGAGACTCTTGAGCAGAGATTGAGTGTGTTGATATCTGTAGGC 
ATCAGCCTGGCTTTGCTGAGTGAGCTATATCAGAGTGGAGGAGGCC 
AGAGGCAAAGTCCAGACTCCACTGGATCCTGGATTGAGGGGAGAAG 
GGGCTGGGCGGAGGAGCAGCCTGAGCACCTGCATCTCACTCCAACT 
GGGTGCTGATTTGTCCCCATGGCCCCAGCACCCAGGCAGGTCACCA 
AGTAAGCTCAAGACAAAAATGATGAGTGACTCAACAGTG 


SEQ ID 
NO. 181 


KCP 1567 
32 


ATAAATTGGATTTCATCAAAAATTTAAACTTCTGCTCCAAAAGACA 
CTCTTAACAAAGGGAAAAAGCAAGCCACAATATGAGAGGAAATATT 
TGCAAAGCATCTGATAAAACATGTGGATCTAAAATATGCAAGGAGA 
ATAACAACTCTATTTTCCACTAAGGAATGAATGACTGTACAAGGAC 
CACATTCTAATTAGGAGCTTCTGAACCCAAAGGAATTTCAGATAAG 
GGGAAATTT AGGC C CAAAGCC AGGAGAAGGGGTGAGTAGGGCTTGA 
TCTCTGCCTCTGAAGGGCAGAGGGCGTGGACTATTCTTGGCTCTTA 
GGGGACAGCTAGAGAAATGTGGGTCTCATGGCGACAACTCTGGACT 
CCATTGGAAGAACCTTCTAACAGTCAGGGCTCCCAGAGATAAACTA 
GACAAGTCACCAAGAGAGGCAGTGGGTACCCCTCACAGGAGGGGTG 
CAAATCAAAGCCAAGGCTTGGAGTGGACCATATTAAATCC [A/T] T 
TTCTTATCCTGTGATTCTTAGAGTCCTATCTGTATCAGGGGAAGGC 
AGGTGGGTTCTAGAACTTTCTAAATGTGTCCCTGTGGGTTTTTCCT 
TCTCCAGCTACACACAAACTTGGGCCTAATAAGAAGTCTATGGCAT 
TAACCCAGCAGGAATGCTTAATGCTTATATCTGACCTCAAACCAAG 
ACTGTCTCCACAGTGAACAACCCCGTCCTGTCCCCTGGGCGTCTCC 
TTAGCAAATGCCATCAGTCAATGGTGCAGCCATCTTGGAGCCCTTG 
CCATCTATAATCTTCTACCGCCACCCCCCCAGCTGATTGTTTTCTT 
TGTATGTCTCCTTCCTGGACATTACTTATTCTTTACTTTTAAATAT 
TTGCTTCCGTAAAAAAACAAATGAATGCCTCGGACAGATTTATAAA 
GAACATTCCTGGAGAGGCGGGTGGATTAATTATTCAGCATCCTCTC 
CCTTTGTAACTATTTATTGTCTCATATGCATTTATATGG 


SEQ ID 
NO. 182 


KCP_1586 
17 


TTGCCCAAGTGATGTTCCATGTCAGGCTCTAGGGTCCCTGCAGGGA 
CAGAGAGGGACTAACATTTACTTACATGCCTATAGTATGTCAGGCA 
TATACTTGTGCCTTTATATATATCAGCTCTGTTTTTGTCATTAAAA 
CATCCCTGTAGAAAGATAGGCACTGCTGTCCCATTTTACAGATGGG 
GAAACCCAAGCTCTGAGTGGTTCAGCAAACCCTGGGTGCATACCCC 
CACCTTGCCCCTGCAAAACCAACAAAAAAACGAAGGCCCTGCCTTC 
CTGGAGCTGACATTTAGGTTGATTCTGAAAGTCAGTAGGCCCAGAT 
TTTCACTCTTCATTTTTCTTGTTTGGAATGAGAGAGCACACAGCTG 


SEQ ID 
NO. 183 



-159- 





GGTCGGGGGAAGGAGCGAGGGTCTAGGCCTGCATCCACTCACCCCA 
AAGGAAAGGAGTAGGGGACCAGTCTGCTGGACATGCAGACAGCGAT 
TGGAGAAAAGTCAGCCCAGCTATGAACCCCATTCCTTTCAGTA [ C / 
T] GAGCCAAGAGGGATGGCATCTGTCAGAGTTGCTGGATTTGGGAT 
TTTGCATCTTGCCAAGTGTCCATGAGGAATTGGGGAAACTCTCCCC 
CTGGCTGGACTGAGGCTTCAGCAAGCATTGTTGCTGCCCAGTGGTG 
ATCAGCTCAGTGTCCTTGGAAAAGAGCAGAAAGTGGTATCACGAAC 
ATATCTTCTCCTTTGCTTCCTTCTCCTCACTCTTCATCATCATCAT 
CATCATCATCATCAAATATGGATCTGTGAGGCTACCTCTGGGGTTG 
AAACTTGGTTTTGGGCAAAATTTGTGATGTTCTCTCTGCCCAATCC 
AGCCTCAGGCTACAAATGAATGTAAAAATCTCTAATTTAGTGCCAA 
GTAACAGAAAACAGCTCTACTTATCTTAAGCCAAAAAGAGGGACTT 
CTCAGAGGCATACTAATGGAGGATGGCAAGAGGGCCTCACGTGGAA 




KCP 1601 
45 


GCTCTTCTGCTGTGGAGGATCCATGCCATTGACCTAGGCACCCGTT 
TTCCACATATTGAGCATTGCTGAGCACCTATTCTGTGCCAGGCACT 
GTGCTTCAGGGCCATGGGGGATGCTCCAAGCGGTAAAATGCAACCA 
AAGCCCCGAAGGAGCTCACATTCTAGTCATGTCCACAAAGAGGTAA 
TAAATCCATAAATTGTATGTACTATTCTAGTCACAATAAAATTGTG 
TCGTACTGTAATGCTGGGTATCCATTTTAAAACGGGGGGCATCGGC 
TGAATCTGGGTCATTACAGTAGGAAATGCATATATATAATCATTTA 
CTCATGAATATTAATGTATTTAATGAGGGTAAAAGATATTACTTAA 
AGCAAAGTATTCGTTCCAGCTACTGTTGGATTTGTTCATTACTGTT 
TCCCATGCAGATATTACCTGTGATTTACCTGCATATCAAGCATCTG 
GAAGTAGCTCAAATCCACCTGTGGGTAAATTAGGTTAGCC [A/G] T 
TTGTTGGCAAAAATTACAGTGTTAACTAATTTCCAGGGTATGCTTG 
CAGTCAGTAGTTTCATACTTAGGTACATGACTTGCATTCACATCAT 
CTGGTTAATGGTGTGAACAGAGATTTTCTTTATGGTTTTTGGAATA 
CAGTAAGATAATGTTAAGCTAACGTAAGTCTGTTAACAGTACCTGG 
TTCTGAACTGTATTTATAAGGTGTATCATAAAACCATTACTTTGGA 
GTTTGCCAATCTTAAATTCAGAACAATTCAAAAATGAGCCAGAATC 
TAGTTTGCATCATTACCACTTATAAAAATAAGGATCTGTAAGTTGG 
CTGGATAAAATATATTACAAAATAATGACTTAAGTGGCTCTGGAGC 
CAGCACAAAAGATAAAAATTGGGTATACTCAAAATTACCTTCAAAA 
TATCTTAAGTCATTCTTAAAATACATGTAAATATGCCAACTCAAAA 
TACATCCAACAAAACTAATATTTTTCCCAATTTGTTGGA 


SEQ ID 
NO. 184 


KCP 164 8 
97 


TCGACGTTTCCAAAGTCATGGGGCCTATGGTTTGTGAGCTTATTTA 
GGTTGTCCCCGGGCCCAGCATCAAAAGCATTGAGACACGTACTGAG 
GGACTCTTTTCCTAGCCTCTCAGTCCTGACTGCTCAAGGACCAAGT 
GGTACTTCTTGCCTGCGTTCCTTTAATGCTTGCCTAATATGAGCTA 

GTCCCAAGAGCCTTTGGAAAGCAGCTGCCTCTCACTAGGACTTCAC 

AGCATCATTTTGCTTTGCTCTCTTTGTGGTTAAAATTACCTTCCAT 

TCGTGGTGGGTGTATGTCAGGATCCCCACAAGAAACAGAGGGACAC 

CCAAATTAGGGACATACTTCAGAGGGACTAATGACAAAGGCATGGG 

TGGGAGTAGAGGGGAATACAAGGGAGACTTCAAGAATCTTGGCCTT 

TATTATAAATGCAATGTATGTCCACTATGGAAAATTTGGG [A/G] A 

AAAAAGCAAACTAGAAAGAAGAAAAACCACATTGCCTGAATTCCTA 

CTGCATGGAGAGAAGCATCATAAACACCTTTTGGAGGAGTCTCTTC 

TTCCTTTCTCCCTTTCTCCTTCTTTGTATAGAGAGGTCTTTCCTGA 

GGACTTCCCAGAATCTTGCAGATCCAAAATCTTAAGAATTTGCAGA 

GGCAGTGAGGAGTTAACATGCACAGCTCAGGGAATATTCTGCTTTT ! 

TATCTGGAACCAGGCTCGGAACAAGACTCCTTGCTTTTCTGTCCTG ] 

TGTTTTCATCTTCTCTCAGAACCCTAACTTTGAGATAAGATCTTTG 

ACTATTATTAGGCGGGTGCAAAAGTAATTGTAGTTTTTGGCATTAT 

TTTTAATAGAACTGCTTCTGTGTCCTCAGATCTCCATCGTTCATCT 

CCTGATAAGTCCCTGAAAATTTCCTGGCCCCTTGGAGCTCCTTCCA 

GGAGTAGAATGATCACAAGAGCTGCCATGTATTGCTTAT 


SEQ ID 
NO. 185 



-160- 



KCP 1692 
34 


TTGCTTATTCCAACTTGGACTTGCCGGAGTCCCATAGACAGAGGCT 
ACTCTCCCACCGTGCTGAAGCTGGTGCATGCCATGTTTCAGTAAGA 
GAAAGGAGGGTGCCTGGGCTTCGTCTCCACCCAGGTGCCTCTCCCC 
CAGCAGCTGCACCAGGCCAGCTGAGGGGGATTTTAGCCCGAATCCA 
GGGTTTCTCCTACAGAAGACAAGGAGTTTGGGCACTGCCAGAATTA 
GAAGAACAGAAAGAAAATGTTCTGGATTTCTCATCAAATGCCCCTA 
GCCTGAGAAATATAACTAAATTCACCCTTAGGTCATCTTACAATCT 
GTCCTGCCCCAGTGTTCCCCACTCAGGGAACTGCTCCACCCACATC 
CTGGTCCCCAAACCAGAGGCCTGGGAGTCACCCTTGACATTTCCTC 
CCCTACACCCCTAATCAATCAAATCCTGTTTATCCTGCCCTCTGAG 
AGTCTGCACCGAAATCTCTCTCCTCTCCTTCCCACCTACC [A/G] T 
GGCCCAGCAGCTTTACCATCATGTCTCACGGATCTCTGCACCGCTC 
CCAACTGGCCTGTGCGTTCACTCCTGCCCCCTCCTCCAGCCCTGTG 
TACACTCCCTTCCACCATCCTTTCTATACTCTCCTCAATCCTATCT 
GCACCCTCCTTCAACCCTGTCTGTACTCTCCTCCAATCCTGTCCAC 
ACACTCCAAACAAAGTCATATTTCCAAGACAAATTTGACCATGCCA 
CTTTCTCCCACAGCTCTCCACCACCTCCAGGATCCCATCCTCAGTG 
TTAGCCAGACACTCCCAAGGCCTTGTGATCTGCCCTGCCTATGTCT 
CCAGCCTCATCCTGCAACTCCCCCTACACTCTGTGTTCTGGCCATC 
AAACCAATGGGCTCCTCTTCCTGCACCCCCATCCACTCTTGCACAT 
CCTGCACTCTATGTCTGAACAGCTCGGTTTCTCTTCTTTTCTTCTG 
GCACATGTCTGCTCTACCTGCAGGTCATCTTAGATGTCA 


SEQ ID 
NO. 186 


KCP_1738 
48 


AAAAGGGAATTTATTGGCTCATGTAACTGAACTTCAACATTTTACA 
GAATCTCATTGGCTCCAATGGGCTCATACGTCCATCCCCAAACCAA 
TCACAGTGACTGAGGGATTATCCAAGGATCACACTGGCCACTTTCA 
CAGGTTTTATCCCTAAAGGAAATCACAGGTAATAGATGTGGGGCTG 
CAGAAATGCAACATGCACTTTTCCTTGAAACTGCATCCCTTTTCCC 
TGAAGATGAAGCTTGAAAGAACTCTAAGAGGTTAAGCATGGAGCTG 
ATGGGCAAGCCACAGGCAGAAAGAGTAGCTGTGCAGCCAGGCTCCT 
GGCCAGGGAGGGCAGATAAGGAGGGGAGGCAAAGTTTGGTAAACAG 
GAAGCTAATCTATGGGCAAGAATCATTTTCTTCAGCATCCTGACCT 
CTCCTAAAATGTTCTCCACTGGTCCCTGCTAGGACAAAGGAATTAC 
CACCAGACTAGAGTCAGGAGTCCTGGGCTGGTTCTGCTGT [A/G] T 
GACACAGGACAGGTGGCTTGCCTGGTCTGGGCCACAGCCTCCTCCC 
CTGTTGATGAGCATGTTGGTTGTTCCAGCACCATGTCAGCCCTAGA 
AATCTCTGAATTCTTGACCAGATCAGTAATTGCTCTCTTGCGTTTA 
CTTTTCCTTCAAATAAAGAGATTGGCATACAGGGGAGGAGCCCAGT 
ACAGACGGCATGCTTGGCTCAGGTTCCAGAACCCAGAAACCAGACA 
AGAGTTGGGAAACCATGATGGTGGAGGAGGGTGTGCCACTCCTTAC 
TAGTGCCTAATCTCTTCGAGACACTAATGTTTCAGTATTATCCACA 
GATTCTGATGC CAGGCAGC CCAGATGACTGGGGTCAGTTATTAGC A 
TGCTTCCTGGAGGTGGTTCCCAGGTGCAGGCTACCTGCAGTCTGGC 
TGGATGGGCCCTGCACCACACTTGCTTCTGGGAAGCTGGTTTTGGG 
GTTGCCACAATCTCTGAAAGAATCACTAGGCCACCCTCT 


SEQ ID 
NO. 187 


KCP 1739 
82 


TTCACAGGTTTTATCCCTAAAGGAAATCACAGGTAATAGATGTGGG 
GCTGCAGAAATGCAACATGCACTTTTCCTTGAAACTGCATCCCTTT 
TCCCTGAAGATGAAGCTTGAAAGAACTCTAAGAGGTTAAGCATGGA 
GCTGATGGGCAAGCCACAGGCAGAAAGAGTAGCTGTGCAGCCAGGC 
TCCTGGCCAGGGAGGGCAGATAAGGAGGGGAGGCAAAGTTTGGTAA 
ACAGGAAGCTAATCTATGGGCAAGAATCATTTTCTTCAGCATCCTG 
ACCTCTCCTAAAATGTTCTCCACTGGTCCCTGCTAGGACAAAGGAA 
TTACCACCAGACTAGAGTCAGGAGTCCTGGGCTGGTTCTGCTGTAT 
GACACAGGACAGGTGGCTTGCCTGGTCTGGGCCACAGCCTCCTCCC 
CTGTTGATGAGCATGTTGGTTGTTCCAGCACCATGTCAGCCCTAGA 
AATCTCTGAATTCTTGACCAGATCAGTAATTGCTCTCTTG [A/C] G 
TTTACTTTTCCTTCAAATAAAGAGATTGGCATACAGGGGAGGAGCC 
CAGTACAGACGGCATGCTTGGCTCAGGTTCCAGAACCCAGAAACCA | 


SEQ ID 
NO. 188 



-161- 





GACAAGAGTTGGGAAACCATGATGGTGGAGGAGGGTGTGCCACTCC 
TTACTAGTGCCTAATCTCTTCGAGACACTAATGTTTCAGTATTATC 
CACAGATTCTGATGCCAGGCAGCCCAGATGACTGGGGTCAGTTATT 
AGCATGCTTCCTGGAGGTGGTTCCCAGGTGCAGGCTACCTGCAGTC 
TGGCTGGATGGGCCCTGCACCACACTTGCTTCTGGGAAGCTGGTTT 
TGGGGTTGCCACAATCTCTGAAAGAATCACTAGGCCACCCTCTGAG 
TGGGTCCTTCTGTAGGAATTATGGATAAAATTGTTCCACTAGTCTT 
ACCTTCTTGGGGAACCCTTCCTGGATTCCCAGGCTGGGCTGGGTGT 
CCCTGCAGCCTAGCCCCACAGCCCTCCTGCTTCTCTTTC 




KCP 1742 
43 


TGACCTCTCCTAAAATGTTCTCCACTGGTCCCTGCTAGGACAAAGG 
AATTACCACCAGACTAGAGTCAGGAGTCCTGGGCTGGTTCTGCTGT 
ATGACACAGGACAGGTGGCTTGCCTGGTCTGGGCCACAGCCTCCTC 
CCCTGTTGATGAGCATGTTGGTTGTTCCAGCACCATGTCAGCCCTA 
GAAATCTCTGAATTCTTGACCAGATCAGTAATTGCTCTCTTGCGTT 
TACTTTTCC TTCAAATAAAGAGATTGGCATACAGGGGAGGAGCC CA 
GTACAGACGGCATGCTTGGCTCAGGTTCCAGAACCCAGAAACCAGA 
CAAGAGTTGGGAAAC CATGATGGTGGAGGAGGGTGTGCCACTC CTT 
ACTAGTGCCTAATCTCTTCGAGACACTAATGTTTCAGTATTATCCA 
CAGATTCTGATGCCAGGCAGCCCAGATGACTGGGGTCAGTTATTAG 
CATGCTTCCTGGAGGTGGTTCCCAGGT [A/G] CAGGCTACCTGCAG 
TCTGGCTGGATGGGCCCTGCACCACACTTGCTTCTGGGAAGCTGGT 
TTTGGGGTTGCCACAATCTCTGAAAGAATCACTAGGCCACCCTCTG 
AGTGGGTCCTTCTGTAGGAATTATGGATAAAATTGTTCCACTAGTC 
TTACCTTCTTGGGGAACCCTTCCTGGATTCCCAGGCTGGGCTGGGT 
GTCCCTGCAGCCTAGCCCCACAGCCCTCCTGCTTCTCTTTCTCATC 
ACAGTCTTGTTATCTCTACCAACTGTAGGCCTGCCCCACTGATGGT 
GTGAATAAAGGGACTGGGTCTCTCTAGCACCTAGCATAGATCTGAT 
ACATAGTGGGTGATCTCTATTGAATGAACGATGAATGAATGAATGA 
ATGAATACATTTAGATAATTCAGATTACTCTTTCTAGCTCAGCAGT 
GTAAAGCAGGAAGACATGCTGTCAATATGATTTAGGGCAAGTTT 


SEQ ID 
NO. 189 


KCP 1751 
06 


AACGATGAATGAATGAATGAATGAATACATTTAGATAATTCAGATT 
ACTCTTTCTAGCTCAGCAGTGTAAAGCAGGAAGACATGCTGTCAAT 
ATGATTTAGGGCAAGTTTTCAAATCTCTCTGGACCTCAGTTTTACC 
TCTTGAAAAATAAATATAATAATTTGTCCTTACTTCATGAGACTAT 
TTTGAAGATTAAATGAGATAATGTATACACTACTACTCACTGTCCT 
TACTTGAATATTCCTAGGTCCTTGGTGCTACATTAGGCTACATAGA 
ATGTATTTAAAGTAATAGAGTGGTATTTAATAAATATTCATTTTCT 
TTCCCCAGAACTACCTTAAATTAATTTGTTGAAAGGACAGATGGAT 
GGATGGTTGATGGAAGTAGCAGGCTTCCAGCAGCAGGGGATGGAGT 
GAGTGTGTGGATACCGCTGGATCAGCAGAAGGTTATACCATTTTAG 
AGTAACTATCTCGGACTTCGGAGAGTTCCTGGGTATGAAG [C/G] T 
TTGGCTTTAATTAAAGTCTCAGCACAGTGTTAAATGCCATTTTATT 
TTAGGTCATAATTAACACTAATGAGATGAGTGGATTACAAAGAGCA 
CACATTTTGAGAAAGTGAAAAACAACATCTGAGCTTGGTGGTTTCC 
ATTTTCGCTTTTCCCCCTCCCATGCTCTGTTCAATTAAAAGTTTTG 
AGAAAATATTACAACCATACTCCTTGTCTTTGTGGTAATGAAGCAT 
ATTAATTTGAATGTGATGAATACAATATTCCACTGACTTTTTTATT 
CCCTTATCTACAAAAGTTTAAAATAATGGACCAATTAAACCAGGAG 
AGAAGAATGCAGGGTTTGCCTGGGGATCCAATTCAGCAACCAGAGA 
ACTGAAAGAACAAAATTTTTTGACGGAGTCTGGGCCAGACTTCATC 
CCTTACCTATAGCTGACAAACAGTAAGTCAAATTGGGCAGATGTGG 
ACCAGCGCAGAACACATACTATATTGAGGATCGAAAGGC 


SEQ ID 
NO. 190 


KCP 1751 
70 


GTGTAAAGCAGGAAGACATGCTGTCAATATGATTTAGGGCAAGTTT 
TCAAATCTCTCTGGACCTCAGTTTTACCTCTTGAAAAATAAATATA 
ATAATTTGTCCTTACTTCATGAGACTATTTTGAAGATTAAATGAGA 
TAATGTATACACTACTACTCACTGTCCTTACTTGAATATTCCTAGG 
TCCTTGGTGCTACATTAGGCTACATAGAATGTATTTAAAGTAATAG 


SEQ ID 
NO. 191 



-162- 





AGTGGTATTTAATAAATATTCATTTTCTTTCCCCAGAACTACCTTA 
AATTAATTTGTTGAAAGGACAGATGGATGGATGGTTGATGGAAGTA 
GCAGGCTTCCAGCAGCAGGGGATGGAGTGAGTGTGTGGATACCGCT 
GGATCAGCAGAAGGTTATACCATTTTAGAGTAACTATCTCGGACTT 
CGGAGAGTTCCTGGGTATGAAGGTTTGGCTTTAATTAAAGTCTCAG 
CACAGTGTTAAATGCCATTTTATTTTAGGTCATAATTAAC [A/G] C 
TAATGAGATGAGTGGATTACAAAGAGCACACATTTTGAGAAAGTGA 
AAAACAACATCTGAGCTTGGTGGTTTCCATTTTCGCTTTTCCCCCT 
CCCATGCTCTGTTCAATTAAAAGTTTTGAGAAAATATTACAACCAT 
ACTCCTTGTCTTTGTGGTAATGAAGCATATTAATTTGAATGTGATG 
AATACAATATTCCACTGACTTTTTTATTCCCTTATCTACAAAAGTT 
TAAAATAATGGACCAATTAAACCAGGAGAGAAGAATGCAGGGTTTG 
CCTGGGGATCCAATTCAGCAACCAGAGAACTGAAAGAACAAAATTT 
TTTGACGGAGTCTGGGCCAGACTTCATCCCTTACCTATAGCTGACA 
AACAGTAAGTCAAATTGGGCAGATGTGGACCAGCGCAGAACACATA 
CTATATTGAGGATCGAAAGGCCAGGTTCCAGACCGTCCTCTAATAT 
TTTCTTAGTGAATATTTGTTGGATGAATGCATGGATGGG 




KCP 1752 
52 


CTTACTTCATGAGACTATTTTGAAGATTAAATGAGATAATGTATAC 
ACTACTACTCACTGTCCTTACTTGAATATTCCTAGGTCCTTGGTGC 
TACATTAGGCTACATAGAATGTATTTAAAGTAATAGAGTGGTATTT 
AATAAATATTCATTTTCTTTCCCCAGAACTACCTTAAATTAATTTG 
TTGAAAGGACAGATGGATGGATGGTTGATGGAAGTAGCAGGCTTCC 
AGCAGCAGGGGATGGAGTGAGTGTGTGGATACCGCTGGATCAGCAG 
AAGGTTATACCATTTTAGAGTAACTATCTCGGACTTCGGAGAGTTC 
CTGGGTATGAAGGTTTGGCTTTAATTAAAGTCTCAGCACAGTGTTA 
AATGCCATTTTATTTTAGGTCATAATTAACACTAATGAGATGAGTG 
GATTACAAAGAGCACACATTTTGAGAAAGTGAAAAACAACATCTGA 
GCTTGGTGGTTTCCATTTTC [A/G] CTTTTCCCCCTCCCATGCTCT 
GTTCAATTAAAAGTTTTGAGAAAATATTACAACCATACTCCTTGTC 
TTTGTGGTAATGAAGCATATTAATTTGAATGTGATGAATACAATAT 
TCCACTGACTTTTTTATTCCCTTATCTACAAAAGTTTAAAATAATG 
GACCAATTAAACCAGGAGAGAAGAATGCAGGGTTTGCCTGGGGATC 
CAATTCAGCAACCAGAGAACTGAAAGAACAAAATTTTTTGACGGAG 
TCTGGGCCAGACTTCATCCCTTACCTATAGCTGACAAACAGTAAGT 
CAAATTGGGCAGATGTGGACCAGCGCAGAACACATACTATATTGAG 
GATCGAAAGGCCAGGTTCCAGACCGTCCTCTAATATTTTCTTAGTG 
AATATTTGTTGGATGAATGCATGGATGGGTGGATGAATAGATGGAT 
GGATGGACAGATGGACGGAGAGAGAGATGGATGAATGGATTGTTGG 


SEQ ID 
NO. 192 


KCP 1768 
36 


GCAGGCCTGTGAACCTGACACATGGTCCAGGTGTCTCCCTGAGGAC 
TTCTGGAAGTCTCCCCACCTCTCTGTGGTCCTTTAGGCATTAACAC 
CACCTTGTCACTGTGTCTTCTGAGGCAGTCTGGAAGTTCATACCCC 
ACAATCTCTGTGTACCTTGTCCCCCATTCTGTTCTCTGCATTGCAG 
ATGGTTTAAAACACACACACATACACGCGCAAAATGTTGTTCCTTT 
TCTTAAAACCCATTGTGGCCAGGCTAGACAAATCCTTAACACGGTC 
TACAATATTCTGCATGGCATGGCCCCTGGGTGCCTCCCAACCTGAT 
CTGTCACACACCACCTCCACCTTTGCCTGTTCCCTGGGCCCTAGCA 
CTAACCTTTGGTTCATTCCTAGACACCTTTTCAGCACTTAGGCCCC 
CACAGCCCTCAGAACCTTTACACTTGCTGTCTCTTTTGCTTTAA [A 
/G] TGTTCTTGCCCCACCTACCACCTAGTTAATGCCTTTTCCTCCT 
TCAGCTCTTAGTTGAAGCATCACTTCCTCAAGGAGGGCAGCCCTGA 
TGAAACTCATTATGCAAACTCCAGCCTGGGTTGGGCCTTATCTTTA 
TGCTGTCATGGCCCTGAGTATTCTTCCTTTATGGCACCAATCACGG 
CTTATATGATATACTTATGCTATTATTTGAGTTATGTCTGTCTCCC 
CCAGTATGCCACTAGTATTAGAATCATTGATTTTTAATCATTGTAT 
CCCTAGTGCTTAGCACAGAGCCTGGCTCATAATAGATGCTTAATAA 
ATATTTGTTGAATAAATGAATGAGTGAATGAATAAATGCCTCATTC 
AAGAGCTTTGGCTCTTTCTGTACTACTACATTACTTCTATTTTTTA 


SEQ ID 
NO. 193 



-163- 





GCTCTTAATTCTCAAAGCACTTTCTTTGTGCTGGGCTTATGCTGGG 
AGCTTAGACAGTAAAGCTTAGA 




KCP 
73 


_1801 


TTACATCCACAGGTTTGATTATAAATGTGTGTATTGAATTGGAATT 
TCTGTTGAAATTCTGATCCCTTCTAGACAAAGAAGGTAAAAATTGA 
AACATGTCAATGGATATCTAAATATCATTACTCACTGGCTTTATTT 
GCAAATGGCTTTCCATTGACAACAGTTACATTTTGTTCAAAGCAAC 
AAATGATTGGCGCTGACAATCCACAGGAACATGGTGCAGTCATTAA 
TGAATGTGCTCATTATTCCTCCCTGCCGGGAGGCATCGACTCCCGT 
TCTCCAGCCTGTTTTAAGCAGACAGACCTACATCTGCACCTGTCAG 
CTTGGAACCCTAGTAGGGGAGGGGGATGCTGATGTGATGGAGAATG 
AAGAATGGGCCCTGCAGGCTGACATTTTGGGAGAGTAGGTTCTGAA 
ATTTATCCCAAAGGACATGGAATCCTGGAAGCAGGGTTCAAGATCC 
TCCCAAAATTGATCTCCCAGGATGCTTGGAATGATTGTTC [C/T] G 
AGGGTTTTGTAAAATGCCAGGGGAAAACCAGGAAGCTTCTCTCCAG 
TTGTCTTGCCTCCTTCCTCTCCAGTCTCCATGGAGCTGACTTTGAG 
AATTAACTCCTGAGGGACAGAGACCCTGGGATGGAGAGCCAGCCCT 
GCTGGATTCCACAAGGTGCTGCTTAAAGCACAACACCTCTTCCCAA 
TGACAGGTTCTGAAAGAAGGCCTTGTAGCTAGATGCACAGAGGGTT 


SEQID 
NO. 194 






GCTCTGGGCTGGGTACAGTGGCTCCCACCTGTAATCCCAACACTTT 
GAGAGCTGAGGCAGGAGGATCGCTTGAGCCCAGGCGTTCTAGACCA 
GCCTGGGCAATATAGTGAGATCTCTATGTCTAGAATGTTTTTTAAT 
TAGCTGGGCTTGCTGCCTGCACCTGTAATTCCAGCTACTTGGGAGG 
CTAAGGTGGGGGGATCACTCGAGCCCAGGGGGCTGAGGC 




KCP 
37 


1802 


CCTTCTAGACAAAGAAGGTAAAAATTGAAACATGTCAATGGATATC 
TAAATATCATTACTCACTGGCTTTATTTGCAAATGGCTTTCCATTG 
ACAACAGTTACATTTTGTTCAAAGCAACAAATGATTGGCGCTGACA 
ATCCACAGGAACATGGTGCAGTCATTAATGAATGTGCTCATTATTC 
CTCCCTGCCGGGAGGCATCGACTCCCGTTCTCCAGCCTGTTTTAAG 
CAGACAGACCTACATCTGCACCTGTCAGCTTGGAACCCTAGTAGGG 
GAGGGGGATGCTGATGTGATGGAGAATGAAGAATGGGCCCTGCAGG 
CTGACATTTTGGGAGAGTAGGTTCTGAAATTTATCCCAAAGGACAT 
GGAATCCTGGAAGCAGGGTTCAAGATCCTCCCAAAATTGATCTCCC 
AGGATGCTTGGAATGATTGTTCCGAGGGTTTTGTAAAATGCCAGGG 
GAAAACCAGGAAGCTTCTCTCCAGTTGTCTTGCCTCCTTC [C/G] T 
CTCCAGTCTCCATGGAGCTGACTTTGAGAATTAACTCCTGAGGGAC 
AGAGACCCTGGGATGGAGAGCCAGCCCTGCTGGATTCCACAAGGTG 
CTGCTTAAAGCACAACACCTCTTCCCAATGACAGGTTCTGAAAGAA 


SEQID 
NO. 195 






TTTAACCTTTCAGCATCTGTCTAAAATTGCTCTGGGCTGGGTACAG 
TGGCTCCCACCTGTAATCCCAACACTTTGAGAGCTGAGGCAGGAGG 
ATCGCTTGAGCCCAGGCGTTCTAGACCAGCCTGGGCAATATAGTGA 
GATCTCTATGTCTAGAATGTTTTTTAATTAGCTGGGCTTGCTGCCT 
GCACCTGTAATTCCAGCTACTTGGGAGGCTAAGGTGGGGGGATCAC 
TCGAGCCCAGGGGGCTGAGGCTGCAGTGAACCATGATTACACCACT 
GAACTCCAGCCTGGGCAACAGAGTGAGACCCTGTCTCAA 




KCP 
80 


1840 


CTGATGGAACTGGGATGTGAGAAGAAGGCAGGTTTTCTGATAAACA 
ATTCCTGTATCTTTCACAAATGCCAAATCACAGACTCAGCTTGGGA 
CATATGAGGACAGCACAGACTTTGGAGGCAGGTAGATTTTGGGTTG 
TCACGCAGACACCCACTACTATGAGACCTGGATTTCCTTCTGACGT 
TATTGGGGATAAGAAGTGGCACCTCACCATTTCTAGGAAATAGTAG 
GTAAGTCTTTCTGGTTGCCACTGAGGTGACTCACCTGAGACACAGT 
TGCTCCTAAAGTTCAAGGTTAGGAGACAATCCAGAAGGGGAGCTGT 
CTGTGAAGTCAGAATTCTTGGAAGAATGTAAGTCTTTACACAGTAA 
CAGCAAAGCAGACAGTGGGAACCACTACTCTGCCTTCTTGCATCAT 
TCTTTCCTAGAAATACCAGAAAGCAGTGAGGGATTAAGTCTAATTC 
CTGGCACCTGACCTTATATCTAACAGATGCTCAGTATTAC [C/G] T 


SEQ ID 
NO. 196 



2345.2049-005 



-164- 





GTTGATGGGACCTCACTGGGAATGTTTTGTGTGCAGTACAAAAGGG 
CAATAGATGAAACTTTGGGACGGGAGCCCAGGAAAATGGCTGAGAG 
GAGAGCTTATGCCTAGCTTATGCATGAGCTTGCAAAAAGGGAGAAT 
ACACGGGAGGGAAGATCAGCAACAGCATGAGTTTTATAAGGCAGAG 
AGTTGTTGGGAAGGAAGCAGCAGGGAGAGGGGAAGGAGTAAGTAGA 
AACCTAGAAGAGATACAGCTAAGATAAGCCAAGAGAACAAAGTATT 
GACTTACCAGAAACATGGAAGTCTTCCTGCTTCTAATTTAGTTCCG 
CATATCTGGATATGTGAATGCCTAAAATCCCATTAAGCCCAGTGGG 
TTAATTATTACACTTGCTAGGGCCCCAGAGGAGAGGAAACACAGTA 
AGTCAGAAAAACCTCTGGGCAGGTGAATTTCTCAGGTTTTCTTCTG 
GGCAGATGGGATCTGGAATGGTAGCGTGGCATCCTGGTA 




KCP 1855 
79 


CCTTTCCAATATTAAAATAATATTAACATTGGTAATAGTGGTACTA 
AACAACTTAGGGTGTTTTTTTTTTCATTTAATAGTATATTTTTAGT 
ATCTTTCCAGGAAAAGATACATGGATGTGCCACATTATTTTTAATG 
GCTCACATGGTACTCCTTTTATGTATGCACTATAATTTATGGAACC 
AGTTTTCTCACCGATGAGCATGTAAGTTCTTTCAGTCTTTTACTGT 
TATAAACGAATGATGCAATGAATATCCTTGTACATATATATTTGTG 
CGCATATGTAGGTATCCTTACAAGTGGAATTTCTGAATAAATGGAT 
ATATACAATTTATTTATGAATTTACCTTCCTACAAGTGATTCAAGA 
GAGTGTCTTTGCTCCACAGTGTTGTCAATATAGTGTATTCTCAAAA 
TCTGACACCAATATGTGTGAAGTGCCTGCTCTGTTCCCACACTTTA 
CACAGGTTCTCTTATTTG [C/A] GTTAAGTTTATTTAAGAAGAGGA 
AACTGGGCCTCATGGAGATCTAGGAACTTGCCCAAGGACAGGTCTC 
TGTGACTCTAAGAGTGCAATCTTCCCTTTTCCCCATGTCAAGCACC 
TTTCCCCACCAGGCTCACTGCTGACAATCCAGTGTACGAAGAAGGG 
AAATTACCCCCACAGAGCCCAAAAGTTTAGGACATGCCGACAGCAT 
CACTCTTTTGCCTCCTCATTCTCTCTTTCATTTCCAGAACATTTGC 
TCACTCAGTGCTGCCCAGTGATACTTAGCCAGCCTGATTACCCATC 
TAATAATTTCTGATACTAATATAAAACCTTCCCAAAGACAAATATA 
ACTGAGACGCACTCCAGCTTACCATAGCTTTCCTGGTGGTACAGTT 
TCCAGGGACATTTCACTGTGTCAAAGCAGGGACCACATATGTTCCA 
GACCAGCTTGTTGGGTTTTTCACTGGGAAGTGAAGACAAATTGTTG 
TCCCTT 


SEQID 
NO. 197 


KCP 1860 
48 


TTCCCACACTTTACACAGGTTCTCTTATTTGCGTTAAGTTTATTTA 
AGAAGAGGAAACTGGGCCTCATGGAGATCTAGGAACTTGCCCAAGG 
ACAGGTCTCTGTGACTCTAAGAGTGCAATCTTCCCTTTTCCCCATG 
TCAAGCACCTTTCCCCACCAGGCTCACTGCTGACAATCCAGTGTAC 
GAAGAAGGGAAATTACCCCCACAGAGCCCAAAAGTTTAGGACATGC 
CGACAGCATCACTCTTTTGCCTCCTCATTCTCTCTTTCATTTCCAG 
AACATTTGCTCACTCAGTGCTGCCCAGTGATACTTAGCCAGCCTGA 
TTACCCATCTAATAATTTCTGATACTAATATAAAACCTTCCCAAAG 
ACAAATATAACTGAGACGCACTCCAGCTTACCATAGCTTTCCTGGT 
GGTACAGTTTCCAGGGACATTTCACTGTGTCAAAGCAGGGACCACA 
TATGTTCCAGACCAGCTTGTTGGGTTTTTCACTGGGAAGT [A/G] A 
AGACAAATTGTTGTCCCTTTGAAAAAGCATCTTTCATCTCTCCATC 
TATCTGCGATCTAAAGCAATGGGGCTCTTTCTGTATGTCTTTCAAA 
TGGTCTACACTGACACACGTTTTCTCTGAGCTGCCGAGAGAATATG 
CCATGAGATGTTGCCAGTGATGGTTACACTCAGCTAGCAGAAGATT 
AGGGACTGGTTAAACCTTTGGAGAAATTGCCTTGGGAAAAGAGGAA 
ATAAAAGCAAATATTACTATGAAACATAGAGATTACCAGGTAGGAG 
GAGGAGAGAGGTGGAGGGAGGGGTAGGAGTGGAAGGAAGGGAGGGA 
GGCAGAAAGAGGAAGGCAGACTGGTGGAAAATAAACCGTGCACTTT 
AGAACAGCAGGAAGGGAGGCTTGGAAGCCTGGTTTTCTGGCTTTGA 
ATGACCGCCTAGCGCTTGCCGGTGCGCCAGGGTGCTGTGAGGATGT 
GGGCAGAGGGCGAGTCCGAAGGGCTCCAGACACTGGGAA 


SEQ ID 
NO. 198 


KCP 1866 
79 


GAGAATATGCCATGAGATGTTGCCAGTGATGGTTACACTCAGCTAG 
CAGAAGATTAGGGACTGGTTAAACCTTTGGAGAAATTGCCTTGGGA 


SEQID 
NO. 199 



-165- 





AAAGAGGAAATAAAAGCAAATATTACTATGAAACATAGAGATTACC 
AGGTAGGAGGAGGAGAGAGGTGGAGGGAGGGGTAGGAGTGGAAGGA 
AGGGAGGGAGGCAGAAAGAGGAAGGCAGACTGGTGGAAAATAAACC 
GTGCACTTTAGAACAGCAGGAAGGGAGGCTTGGAAGCCTGGTTTTC 
TGGCTTTGAATGACCGCCTAGCGCTTGCCGGTGCGCCAGGGTGCTG 
TGAGGATGTGGGCAGAGGGCGAGTCCGAAGGGCTCCAGACACTGGG 
AATAGTGGTGGTCGTGTGCTCCTCCCTGAAACTTTTGCACTACCTC 
GGACTGATTGACTTGTCAGACGGTAAGCGAACCCTGGAGCTTCCCC 
GTTTTCTGTGAATGTGTTTTTGTGGCTTCGGTTGCTGTGA [C/G] A 
GTCGTTTCGAAAATGCACGGAAATGAGGGCGGAGACCCGAGAGATT 
TGAAAAAGCCGGGCTGAAACAGCGTGGTATTGGTCCCCGCCTCCCC 
AGTCGCGCCCCAGTGCTGCGCTGTCCGTCGTGCTGAAATGTGGTGC 
GCCTGGGGAGTGCGGGAGCCAGGAAGTTAGGGTCTCCTGCTCCGGC 
CCTATGAGCATGTGAGTCTTGATGGATTATTAGCTATGGGTGAGGC 
CAGCACAACACATCACAATTCTCTCTGAAGCTGTCTGGTAACTACG 
TATATTGTTGATGGAAGCCAGTGACTTTTAAAAGCCATTATGTTGA 
TTAACTTTTTTAAAGAAGTTTAGGAGATTATATGGAGGTAAAAACC 
TTTGTAAAATGCTAATCACAGTGTCTGACAATTAGAACACATTTAA 
TAAATGTCAGTTTCTTTGCTCAACCCTTATAAGAACCCTTATTCCA 
AAGCCACCTCCTCAGCTCTGACTTCAGCTCCATTCCTTA 




KCP 1871 
16 


TGCTGTGACAGTCGTTTCGAAAATGCACGGAAATGAGGGCGGAGAC 
CCGAGAGATTTGAAAAAGCCGGGCTGAAACAGCGTGGTATTGGTCC 
CCGCCTCCCCAGTCGCGCCCCAGTGCTGCGCTGTCCGTCGTGCTGA 
AATGTGGTGCGCCTGGGGAGTGCGGGAGCCAGGAAGTTAGGGTCTC 
CTGCTCCGGCCCTATGAGCATGTGAGTCTTGATGGATTATTAGCTA 
TGGGTGAGGCCAGCACAACACATCACAATTCTCTCTGAAGCTGTCT 
GGTAACTACGTATATTGTTGATGGAAGCCAGTGACTTTTAAAAGCC 
ATTATGTTGATTAACTTTTTTAAAGAAGTTTAGGAGATTATATGGA 
GGTAAAAACCTTTGTAAAATGCTAATCACAGTGTCTGACAATTAGA 
ACACATTTAATAAATGTCAGTTTCTTTGCTC [A/G] ACCCTTATAA 
GAACCCTTATTCCAAAGCCACCTCCTCAGCTCTGACTTCAGCTCCA 
TTCCTTAGTGAGAATGGGGTTATAAATCCAGGTTAACCCGATTGTT 
TAGGATTAGAAAGTGATTTGGTTTCCAACGTTGAAGGAGTTCAAGA 
AACAAAGAGTTTTATTTTTCCTCCTTATGAGATATTGTTCCAAATA 
GAACACAGTTTGTCTAGATGATTTTTGTCACTTAAAATTAGGCTCC 
AGGAAAGATTCCAAATTTCATGAGCAATTGGGCTCATAAAACAAGA 
TCAAACTCCAATAGTGTATATCCAAAGTATGTATAATGTGTATTCG 
GTGTATATTCTTCCACCACTGCATGGTGTAGACAGAATTTCTCTTC 
CAAGGGGCACCACATGACAAAACCGTACATAATAATGAAATGCATT 
TGTAGACAAAGGACTAGCTAAAATACCAACTGAAAGTGGGAAGACC 
AGAAACTGAAG 


SEQ ID 
NO. 200 


KCP 1872 
58 


AATTGCCTTGGGAAAAGAGGAAATAAAAGCAAATATTACTATGAAA 
CATAGAGATTACCAGGTAGGAGGAGGAGAGAGGTGGAGGGAGGGGT 
AGGAGTGGAAGGAAGGGAGGGAGGCAGAAAGAGGAAGGCAGACTGG 
TGGAAAATAAACCGTGCACTTTAGAACAGCAGGAAGGGAGGCTTGG 
AAGCCTGGTTTTCTGGCTTTGAATGACCGCCTAGCGCTTGCCGGTG 
CGCCAGGGTGCTGTGAGGATGTGGGCAGAGGGCGAGTCCGAAGGGC 
TCCAGACACTGGGAATAGTGGTGGTCGTGTGCTCCTCCCTGAAACT 
TTTGCACTACCTCGGACTGATTGACTTGTCAGACGGTAAGCGAACC 
CTGGAGCTTCCCCGTTTTCTGTGAATGTGTTTTTGTGGCTTCGGTT 
GCTGTGACAGTCGTTTCGAAAATGCACGGAAATGAGGGCGGAGACC 
CGAGAGATTTGAAAAAGCCGGGCTGAAACAGCGTGGTATTGGTCCC 
CGCCTCCCCAGTCGCGCCCCAGTGCTGCGCTGTCCGTCGTGCTGAA 
ATGTGGTGCGCCTGGGGAGTGCGGGAGCCAGGAAGTTAGGGTCTCC 
TGCTCCGGCCCTATGAGCATGTGAGTCTTGATGGATTATTAGCTAT 
GGGTGAGGCCAGCACAACACATCACAATTCTCTCTGAAGCTGTCTG 


SEQ ID 
NO. 201 



2345.2049-005 



-166- 





GTAACTACGTATATTGTTGATGGAAGCCAGTGACTTTTAAAAGCCA 
TTATGTTGATTAACTTTTTTAAAGAAGTTTAGGAGATTATATGGAG 
GTAAAAACCTTTGTAAAATGCTAATCACAGTGTCTGACAATTAGAA 
CACATTTAATAAATGTCAGTTTCTTTGCTCAACCCTTATAAGAACC 
CTTATTCCAAAGCCACCTCCTCAGCTCTGACTTCAGCTCCATTCCT 
TAGTGAGAATGGGGTTATAAATCCAGGTTAACCCGATTGTTTAGGA 
TTAGAAAGTGATTTGGTTTCCAACGTTGAAGGAG [G/T] TCAAGAA 
ACAAAGAGTTTTATTTTTCCTCCTTATGAGATATTGTTCCAAATAG 
AACACAGTTTGTCTAGATGATTTTTGTCACTTAAAATTAGGCTCCA 
GGAAAGATTCCAAATTTCATGAGCAATTGGGCTCATAAAACAAGAT 
CAAACTCCAATAGTGTATATCCAAAGTATGTATAATGTGTATTCGG 
TGTATATTCTTCCACCACTGCATGGTGTAGACAGAATTTCTCTTCC 
AAGGGGCACCACATGACAAAACCGTACATAATAATGAAATGCATTT 
GTAGACAAAGGACTAGCTAAAATACCAACTGAAAGTGGGAAGACCA 
GAAACTGAAGTGTAAGATGAGGTAAGCCCTGGAGTAAGAGTCAAGA 
AATCCACTTTCTATCCATAATCTGTCTCGGTTTAATGTTGGTCAAG 
TCATTTTTTAAAAAATTCTAGGTCTTGGTTTCCTTATGATGACTTT 
AGATCTCTGTTCCTTGGAATTCTAGAGTGATCCAAAGGTTTCTTTG 
AATTCAGTTTTGTGGGTTGAGACGGGCAGCCAGACTGTGAGTCCCT 
CAGCTCTGCTTCAACCAGAACAGCTCCACTTTACTGTTCAGCATGT 
TAGCCCTGTATGTAAGGATGTTTTTTAGCTTTAGCTAAAATTTAGT 
GACTCTATGACCCTAAGGCCCTGCTTCCCTGAGATTTTGAAAGCTG 
AAGCACATTCGGAAAACTTTTTCTTCCTTAAAAATCACCTGAAATC 
TGACAATCTGGAAGACTAGTTCTGTCTGCTCCAGCCCTTGGTCCCT 
TAGATGTGCTTTTCTGAAGATCCAAACTCAACCTGCCAGTCAATAT 
ACCAACTGAGCAGAGCCCCTGTTCTCCACCAGATTTCAAGAGAACA 
TGTTCCATTCCTGTTCAGAGCTTCAGAGCAGCTTCCGCTAAGATTG 
CACATTAATGCAACAGCGTCCTATTTTCTTTGTTTCTTTTTTTTTT 
TTTTTTTTTTTTTTTGATGAGACAGGG 




KCP 1876 
88 


ATTTTTCCTCCTTATGAGATATTGTTCCAAATAGAACACAGTTTGT 
CTAGATGATTTTTGTCACTTAAAATTAGGCTCCAGGAAAGATTCCA 
AATTTCATGAGCAATTGGGCTCATAAAACAAGATCAAACTCCAATA 
GTGTATATCCAAAGTATGTATAATGTGTATTCGGTGTATATTCTTC 
CACCACTGCATGGTGTAGACAGAATTTCTCTTCCAAGGGGCACCAC 
ATGACAAAACCGTACATAATAATGAAATGCATTTGTAGACAAAGGA 
CTAGCTAAAATACCAACTGAAAGTGGGAAGACCAGAAACTGAAGTG 
TAAGATGAGGTAAGCCCTGGAGTAAGAGTCAAGAAATCCACTTTCT 
ATCCATAATCTGTCTCGGTTTAATGTTGGTCAAGTCATTTTT [ T/ A 
] AAAAAATTCTAGGTCTTGGTTTCCTTATGATGACTTTAGATCTCT 
GTTCCTTGGAATTCTAGAGTGATCCAAAGGTTTCTTTGAATTCAGT 
TTTGTGGGTTGAGACGGGCAGCCAGACTGTGAGTCCCTCAGCTCTG 
CTTCAACCAGAACAGCTCCACTTTACTGTTCAGCATGTTAGCCCTG 
TATGTAAGGATGTTTTTTAGCTTTAGCTAAAATTTAGTGACTCTAT 
GACCCTAAGGCCCTGCTTCCCTGAGATTTTGAAAGCTGAAGCACAT 
TCGGAAAACTTTTTCTTCCTTAAAAATCACCTGAAATCTGACAATC 
TGGAAGACTAGTTCTGTCTGCTCCAGCCCTTGGTCCCTTAGATGTG 
CTTTTCTGAAGATCCAAACTCAACCTGCCAGTCAATATACCAACTG 
AGCAGAGCCCCTGTTCTCCACCAGATTTCAAGAGAACATGTTCCAT 
TCCTGTTCAGAGCTTCAGAGCAGC 


SEQ ID 
NO. 202 


KCP_1893 
31 


CTCTAAAATTTCACCCTCTGTTCTGTACACCAAGTACCTCAGCAAG 
TAATCCAGTTCCAGATGGGATCTGCAGTCTGCCATTAAGTCTTTAC 
CACACATAGGCTCTTATGCTAGAGCCCTTACCATATGGTCCAAAAT 
GCCATTTTTAATGTGTATTTGATATGGAGACTCTGTTCACAATTTG 
AGTACTAAAGAGAGAATACCACCTCCTAGTAGATACACCAGGACCA 
ATGTAATGCTGTCATTCTAAGGAGAGCAGTGGAACATCTCCAAAGA 
ACCCATCTGTAGTCTTCCTTCGGCCCTTGATCTTATTCCTATTTTA 
TTTTTAAGGTTTTTTTTTTTTTCTTCGAGACTAAATCTCACTCTAT 


SEQ ID 
NO. 203 



-167- 





CACCCAAGCTGGAGTGCAGTGGCATGATATCAGTTCATTGCAACCT 
CTGCCTCCCGGACTCAAGCGATTCTCCTCACTCAGCATCCCAAGTA 
TCTGGGACTACAGGCATACACCACTATGCCCAGCTAGTGT [A/G] T 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTAGTAG 
AGACAGGGTTTCACCATGTTGCCCAGGGTGGTCTTGAACTCCAGAG 
CTCAGGCGATCCACCTGCCGAGGCCTCCCAAAGTGCTGGGATTACA 
GGCATGAGCCACAGCGCCTGGCCAATCTTTTAGGGATAATTTTAGA 
ACAGTATACAGATATTGAGCCAAGAGTCAAAAGAGCTGGGTTGCAA 
TTCTGGTTGTGCCATTTATCAGTTGTGTGAGGTGGGACAAGTCTCT 
TTTTCTCCCTAGCTTTCTCTTTCCTCATTTATAAAATAAAGAAATG 
AGAATGATAGTTGTATTAATTTCTGAGGACTGCCAGAACAAATTAC 
TACAAACTGGGTGGCTTAAAACAACAAACATTTATTCTCACATAGT 
TCAGGAGGCTAGCAGTTTGAAATCAAGTTCTTGACAAACTCCCCTA 
GAGTCTAAAGTCTCTAGAGAAGGATTCCTCCTTGCCTCT 




KCP_1927 
42 


GAGCAAGCACTGCAGCCATCCTCCTTTATTTCCCTCAAGGCAATAT 
CCAAGGATTAAAAAGTCAGAGCCGTCTGCAGATTCCTCCTCTCTAC 
CTTGCCCTGCACTTTTTGTGCCCTTCCTCTTCCCCCTCTCCAGCCC 
CAAACCTCTCTCCTGATCCACGGTACTCCTCCTGGGATGTCCACTG 
GGGCTGATCCTCCCCCATTCTCCCCCTGAGTTCCCTGCTGTTAATC 
TGTCTCCAGCAAAATTAACCTAGCCTATGTCCCATGCCCTCTGGAC 
TCTGGCTGCTCGTCAATCACTCTTAAAAATCCGGTTTCTCCTTAGG 
CAATCATTTTGTTTTGATTTTATGTGTAAAAAAACCTGAGTAAATT 
TTTTTTTTTTTTGAGATGGAGTCTTGCTCTGTTGCTCAGGCTAGAG 
TACAGTGGCATGATTTCTGCTCACTGCAACCTCCGCCTCCCGGGTT 
CAAGCGATTCTCCTGCCTCAGCCTC [T/C] TGAGTAGCTGGGACTA 
CAGGTGCCCACCACCATGCCTGGCTAATTTTTGTATTTTTGGTAGA 
GACAGGGTTTCATCATACTGGCCAGGCTGGTCTCAAACTCCTGACC 
TTGTGATCCACGCACTTCGGCCTCCCAAAGTAATCACTGCTGGGAT 
TACAGAAGTGAGCCACCGTGCCTGGCCAAACCTAAGTAAATGTTTT 
AAAATTATACTACTAACATAGCATACAGGCTTTAGACTGTTGGTTG 
CTTTTAAGTTTGCTTACTTTAAAAGCTAGAGAGAAGATGGTTGAGG 
TGATCTTGTCTCCTTCAGTATTCACTCTGAGCCATGCCTCCTGAGG 
AAGTTTGCTTTAGGGGAGGCATTGCTATGTTATACACTCTACGATG 
CACCAGCCCTTGCCTCAGAAGGCAAGGTTTGAACCCCAACACTGTC 
TTTTGCAAACTGTTACCTTAGGAAATAGATTTTATCTCCTTAACTC 
ACTTTTTA 


SEQ ID 
NO. 204 


KCP 1931 
93 


GTTCAAGCGATTCTCCTGCCTCAGCCTCTTGAGTAGCTGGGACTAC 
AGGTGCCCACCACCATGCCTGGCTAATTTTTGTATTTTTGGTAGAG 
ACAGGGTTTCATCATACTGGCCAGGCTGGTCTCAAACTCCTGACCT 
TGTGATCCACGCACTTCGGCCTCCCAAAGTAATCACTGCTGGGATT 
ACAGAAGTGAGCCACCGTGCCTGGCCAAACCTAAGTAAATGTTTTA 
AAATTATACTACTAACATAGCATACAGGCTTTAGACTGTTGGTTGC 
TTTTAAGTTTGCTTACTTTAAAAGCTAGAGAGAAGATGGTTGAGGT 
GATCTTGTCTCCTTCAGTATTCACTCTGAGCCATGCCTCCTGAGGA 
AGTTTGCTTTAGGGGAGGCATTGCTATGTTATACACTCTACGATGC 
ACCAGCCCTTGCCTCAGAAGGCAAGGTTTGAACCCCAACACTGTCT 
TTTGCAAACTGTTACCTTA [G/A] GAAATAGATTTTATCTCCTTAA 
CTCACTTTTTACATTTGCAAAATGGGTAAATTGTGACTACCTCACA 
TGGATGTCATGAGATGAAATGTAAGAATGTGTGTCCCTGGCATATA 
GTAACCACTTTCGCCAAAGACTGAGTTATCCAACTACAGACAGAGA 
ACAGCTGGTGGCCTAATCAAAGGGAGATACAAAATAACAATGCCAA 
GACTGGAAAAGGAAGTTCATCTTAGGATTTCCAAGAGAAAAAGAAA 
TATGACTGTATTATAATAGGTATATTTATTAAGCTCTTACCATGTG 
CCAAGCAAAGTTCTTTATATACATGATATACTTCATATACATTATT 
TCATTTAGTCCTCATGGCTACCAGGTGAGCACCATTATTTTCCCAT 
TTTACAGATGAGGCACAGAGAAGTTAAGCCACTTACCTAGGAAGGG 
CAGTCCTAGTTAAGAAGCTGGGATTCAAATCCAAGAGGCTGGATTC 


SEQ ID 
NO. 205 



2345.2049-005 



-168- 





CAGACCTCAGG 




KCP_1939 
56 


TTATAATAGGTATATTTATTAAGCTCTTACCATGTGCCAAGCAAAG 
TTCTTTATATACATGATATACTTCATATACATTATTTCATTTAGTC 
CTCATGGCTACCAGGTGAGCACCATTATTTTCCCATTTTACAGATG 
AGGCACAGAGAAGTTAAGCCACTTACCTAGGAAGGGCAGTCCTAGT 
TAAGAAGCTGGGATTCAAATCCAAGAGGCTGGATTCCAGACCTCAG 
GCTCTATTATGAGAAGTACCTAAATAGAGATTGGTTTAACCAAAGC 
CTGAGTCCCAACTAAGGGCAAGACTGTGACACAGAGGTCACTAATC 
AGAATGAAAGATTGAGCCAGAGTTGAGTTGTTGGAATGTATTTTGG 
TACATTTAGGTTGTTTTAAGTATATCAATCTCCATTCCACTCAATG 
GTTGAGTTCAGTTTCAAGTTTTCCAAATGCTTTATGGGAAAGTCAT 
ATTTTTCTCCCATTGCAGCAGGGATGCCAGCGCAGCCATG [C/T] T 
TCTCAACCACCAAGTAGAAGCAAAGCCAAACTGACCCAAGAAGATG 
AACAGAGGGAATCCAGGGAGTTCCAACTTGGGTTCACAGCTGCAAT 
TCTCAAAGGATGGACTAAGCCATGTCACCCCTCCAGATAACACAGT 
CATATTAATAGTGACCTTTTGGAGGCCTCCCTAAACAGCAGGTGAA 
GTCCCAAAATCATTAGATTATTCCTGGCCTCAATTGTGGCCCAGAG 
GGAGAGCCCTAAGATTTTTCCATGGGAACAAAGATCTAAATTCTGG 
GACTATCTGGGCCATGTCCACCCTGCACCATTTACTACAAAATGGG 
CTGATCCTATGGAAGCACACTACCTGTGTTGTGGTCATATAGATCA 
TCACCTGGCTTCTCCAGGGCTAACCAGTTAGCATGGAAATGGGACA 
CCCAAGAACAAGAGGATAGAAAGAAGGGAAGGGTGGAAAGAAGGAA 
GGAAGAAAGGGTGGGAGGGAGGGAAGAGTGGTAGTTTTG 


SEQ ID 
NO. 206 


KCP 194 6 
16 


TCCTGGCCTCAATTGTGGCCCAGAGGGAGAGCCCTAAGATTTTTCC 
ATGGGAACAAAGATCTAAATTCTGGGACTATCTGGGCCATGTCCAC 
CCTGCACCATTTACTACAAAATGGGCTGATCCTATGGAAGCACACT 
ACCTGTGTTGTGGTCATATAGATCATCACCTGGCTTCTCCAGGGCT 
AACCAGTTAGCATGGAAATGGGACACCCAAGAACAAGAGGATAGAA 
AGAAGGGAAGGGTGGAAAGAAGGAAGGAAGAAAGGGTGGGAGGGAG 
GGAAGAGTGGTAGTTTTGGAAGGAAGGAGGGAATCAGAGCTAAAGA 
TAATACATGATATGAGTCAGTGTTCAATGTCCCTGAAGATTAGGGG 
AATCAAGCTTTGCTTCCAGGAGAATTAACACAGGAGAGCCAACAGA 
GATGTGGAAATTTAGGAAGTCAGAGGAGACATTCTTTCA [T/C] TC 
ATTCATTCGTTCATTCATTCACTTGCTCATTTTTACATGAATTGAC 
TCTAGAACAGATGCTGGAGATACAAAGATGCATGAGACTTGCCCCC 
ATCCTCAACAGTCATTCACAGTCTAATCAGAAAGAGAGCCTTGCAT 
TTGGAATACAATATGGAGTAATAATACCTCTGTGTTCAGCCTGCAC 
AAAATACTCTGTATGCATGGTCATATGTCCCTTGAAACAACTTTAT 
GAGGAAGATACTACTATAGTCTCCATTTGACAGATAAGGAAACTGA 
GGCTTAGGGAGGTCAAATAACTTGCCCAAGTAAAACAACTAGTAAG 
TAGCTGAACCAC7^AAACAGAGATTCATGCAGAAAGCTGTACAACAG 
AAGAAACCAGGACTACATCTGCCTCAAAGGAACCAGAGAAGGCTTC 
CAAAGAAGGCAGCATTTTAAATGGGTTTTGAAGGATGTATAGCA 


SEQ ID 
NO. 207 


KCP_1965 
48 


CCCAGCCCTCAGGCACATCAGTGCCCTCTCTAGGCTCTCTCTCACC 
AACTTTAGAATTGAATTACATCAGTTGTTTCCAGATGGTGATCTGC 
AGAATTCCTTTAAAGACCACCTGTGGGATTTGAGGGAGGAAAACTA 
CACTCTCCCAATCTCCCTCTTTAACCCAAGCATCTGATTGCTTTCA 
TCTGTTTTACATACTTAGCTTCTGTGCACAACTTCCTTTGATTAAA 
GAGTTCCTTGCCTTTATAGTAGTGGATGATATCTAAGGATGATGTA 
AAATACTGGGTGTTAGCTAAGGTTTTACCAAACTTAAAGCCTTTAT 
GCTTCATAATTCCACTTTATTGATGTAGGAAGACAAATGATAGACT 
TACTTTCAAGGTGGATAGAAGGGATGCGACCTAGCCAAGGCTACAG 
CATTTCTCT [A/G] TGGCACCACTGCCATGACAACCATCAGTTTGA 
ATGCCTTATGGGTGCATCCTATGGGTTATGCACTGGCCCCAAGCCA 
TAACCCCTAGGACTCTAGAGCCAGCAGCAAACACAAAACACTGAAT 
TAATAATGAGTGAGATCTCTGTTCCCATAGCTGCCACAGGCTAAAT 
AAGTTGAGGGGGTATTGTAAAACCCAAGATGAGATCACTGAGCCTC 


SEQ ID 
NO. 208 



2345.2049-005 



-169- 





TGGTATCAAAAAGGTGTATTTCACAGAATGTTTAGTTGGACGAGAG 
CTTGAAGAGCATGGAAACGATCTGGTATCATTCTGGTCAAAGACCA 
GAATTTAGACCCCAGTTCTGCCATTTGCTGACTAATGACTTTGGGC 
AAAATACTTAACTTTCCTGAGAGTTAGTTTCCTCATCTATAAAGTG 
GGGTAATATAACCCACCTTGCAGGATACTGGTAGGATTA 




KCP 1976 
78 


AGCAGGGTCCCCTGAGCGTTCCTCCTCAGCTCCCAACACTTCCCTC 
CAGGCACCAGTATGCAGGGCAAGGTCCTGGAGGGGGCGCCGAAACA 
CCACTCGAGATCCTCACTCTCAGGAATTCAATATAGAAAACACATT 
AAGACCTGTTTACATGGAACTGCTGTTTATAATTATTGTTCCCTAT 
GGGATATTCCCCACTGCTTCCTCCAATCCTCTTTTAAACTGCTCAA 
CTAATAGAGTTTTCCTGGCTTCCCCAGGGAGACATTCACAGATGCT 
AATAGAGACATAATTCAAAAATTGCTTGATATACATGCCCTCAATT 
TTCCCCAAGAACCACCTAAGTAAAGAGCCCCAGACATGCAACACAT 
TCATTGGCCAGATGCAATTTAACATGCGTGGGATTAAATATACAGG 
CTACTACAGCCAGGTTGTCATCAAGCAGCAGCAGGCATGGCATTTT 
ATCCTAAGGTACCACCA [T/C] GGCCAAATGCAACAGGAAAGAAGC 
AGGCTGCTGGGTGGGACCCCTGGAAGATCCCCTCCTCTGTAATTTC 
CACTGCAAGCTTTTCCCAGGCCTTTTCAGGCAAAGCGGGGAGTTTT 
GAAAATAAATCCCCCAGGCTTGGAGAAGCAAAGAATCAATGCTAAG 
CAGCTCCGGAAATAATAGCTTCCATCTCTCTGATATATAAAGAGGA 
TAAGGAAGGCAGAAAGAAGGGGCATGATATTATGAGATTGCAACAA 
TACATTGCAACATTACATTAAAGAATTACAGAAAGCAAGATCTAGC 
TTCAGATGCCAGTTCATGCACTTACTCCCTGTGTGACCCTGGGAAT 
CACTTAAGCTGTCTGAGACTTAGCTTGTCTAATGACAAACTGGGGA 
TACTAATAT CAC CTCCCAGGATTGTTGGGAAGGTAAATGGAGATTG 
ACAAATGTGAACACACTTAGTATGTCTTT 


SEQID 
NO. 209 


KCP 1977 
75 


TGTTTACATGGAACTGCTGTTTATAATTATTGTTCCCTATGGGATA 
TTCCCCACTGCTTCCTCCAATCCTCTTTTAAACTGCTCAACTAATA 
GAGTTTTCCTGGCTTCCCCAGGGAGACATTCACAGATGCTAATAGA 
GACATAATTCAAAAATTGCTTGATATACATGCCCTCAATTTTCCCC 
AAGAACCACCTAAGTAAAGAGCCCCAGACATGCAACACATTCATTG 
GCCAGATGCAATTTAACATGCGTGGGATTAAATATACAGGCTACTA 
CAGCCAGGTTGTCATCAAGCAGCAGCAGGCATGGCATTTTATCCTA 
AGGTACCACCACGGCCAAATGCAACAGGAAAGAAGCAGGCTGCTGG 
GTGGGACCCCTGGAAGATCCCCTCCTCTGTAATTTCCACTGCAAGC 
TTTTCCCAGGCCTTTT [C/T] AGGCAAAGCGGGGAGTTTTGAAAAT 
AAATCCCCCAGGCTTGGAGAAGCAAAGAATCAATGCTAAGCAGCTC 
CGGAAATAATAGCTTCCATCTCTCTGATATATAAAGAGGATAAGGA 
AGGCAGAAAGAAGGGGCATGATATTATGAGATTGCAACAATACATT 
GCAACATTACATTAAAGAATTACAGAAAGCAAGATCTAGCTTCAGA 
TGCCAGTTCATGCACTTACTCCCTGTGTGACCCTGGGAATCACTTA 
AGCTGTCTGAGACTTAGCTTGTCTAATGACAAACTGGGGATACTAA 
TATCACCTCCCAGGATTGTTGGGAAGGTAAATGGAGATTGACAAAT 
GTGAACACACTTAGTATGTCTTTACATAGTAGGTATTCAATAAACT 
CTTCTATATATCTTCTCTTTCTGAAAATCTGAATATGGGGAGCATG 
GATATG 


SEQ ID 
NO. 210 


KCP 1989 
33 


GTCCCCACCACTCCTTTTTATTGCAGAGGGAATTGACATTCAGGGA 
ATGGAAATGCCCAGCCCAGAATTGGGGATGTGGTCTGGGAACCCAG 
GTCTCCCATCCCACTCCCTCGCCCTCTCACCCCCTCCCGCTGGTCA 
GTGTTCTTTGTCCTCTGCTGGCATCCCTGGGGACGGGCCAGCCCCC 
ATCCCCCCGACACACACACATTGTCCCTTCAAGATGGAGCCAGGCT 
GACACCACGTAGAATGACCTGGAAGCCCCCACTCAGTCTACCAGTC 
CTCCCTCCTCACACAGGAATAGATGGGAGGGAAATGAAATAAGCTG 
CCATCTGCTGTGCATCCTCTGTGTGCCATGCTCTGGGTACCCATCT 
AATCCTCGTGAAGACCCTGAGAAGTGAGTGTTCTTCACAGACTAGG 
CAACACCAGAAGGCAG [G/A] TGAAGAACGTACAGAAGCTACAGAG 
TGCACAGGTGACAGGTATGAGAGCCAAGCCATTCAAACTCCCTGGG 


SEQID 
NO. 211 



2345.2049-005 



-170- 





TATAGGACCCAGCTCTTCCCACGTCTCTGCCTTTACCGAATCAAAC 
ACCTGAGCACGGAAGACCCTCCATCAACATGAACTGCTTTGAATTG 
ACATGAACAAGCTTCAATCAAACTATAAATGCTGAAATTTTTCAAT 
TATAGAAAGTATTTGAAAGATCCCATAAATTCCCCTGTCATATCAC 
GTGAGCTGCATTTACTGCAGCAGACACTTTTTATCTCGGGCTTGGA 
GGAAGGATTAGCAAGAAGAAAGTGGAGGGGGTCTGAGGAAGGGCTG 
GCAGCCTAGAGGAGGACAGCAGCAAGAAGCAGGCTGGAGGCAGTTC 
TGTGCTGCCGGCCTTCATGGGTGTGGCCTTTGGACAGCACCTTAGC 
AGGAATGTGGTGGAGAGCAGCCCCATTCACTCCAGAGGAGAGC 




KCP 1993 
65 


AAGACCCTGAGAAGTGAGTGTTCTTCACAGACTAGGCAACACCAGA 
AGGCAGGTGAAGAACGTACAGAAGCTACAGAGTGCACAGGTGACAG 
GTATGAGAGCCAAGCCATTCAAACTCCCTGGGTATAGGACCCAGCT 
CTTCCCACGTCTCTGCCTTTACCGAATCAAACACCTGAGCACGGAA 
GACCCTCCATCAACATGAACTGCTTTGAATTGACATGAACAAGCTT 
CAATCAAACTATAAATGCTGAAATTTTTCAATTATAGAAAGTATTT 
GAAAGATCCCATAAATTCCCCTGTCATATCACGTGAGCTGCATTTA 
CTGCAGCAGACACTTTTTATCTCGGGCTTGGAGGAAGGATTAGCAA 
GAAGAAAGTGGAGGGGGTCTGAGGAAGGGCTGGCAGCCTAGAGGAG 
GACAGCAGCAAGAAGCAGGCTGGAGGCAGTTCTGTGCTGCCGGCCT 
TC ATGGGTGTGGC CTTTGGAC AGC [A/G] CCTTAGCAGGAATGTGG 
TGGAGAGCAGCCCCATTCACTCCAGAGGAGAGCCTCAAACTCTTCA 
GGCAGATCTAGCCTAGGTAGAATCTTGGCCTGGCCCCTCCGGGATG 
ACAGGTGCCATTGCCCAAGAATGGGGAAAAGGCTGAAGTGCTCCAG 
CCAAAGACCCCAATTTATCTTCAGGACAATTTTCACTGGAAACCTT 
GCCTCACCACTGCCCACTTTTTCAGAAGTAATTAGAATGCTAATCT 
ATAAGAAAGATGACTATTAAAAATAAATTAATAATAGATAATACAT 
TTTGGCTTACAATTTTGAATAATATAGCCATCCCATCTTAAAGTAA 
AAATTCATATATTTTTAATAAGCCTGAGACATGTTTTCCAATGAAC 
CACAGATGGTTCATTTTTATTATCCTATAAAGAGACATTATGGGCA 
AGTGTTTTTTAAAATGGTAAAACAGAACCTTAGAGCAGCTCTCTTT 
T 


SEQ ID 
NO. 212 


KCP 2002 
41 


GGCAAGTGTTTTTTAAAATGGTAAAACAGAACCTTAGAGCAGCTCT 
CTTTTGAAGATCTCTAAGCACTTTCTAAGCATCAGGACCCCCTTCT 
GTCATCACAGAGACTGAAATGAGGAGATGGTCTCTGTCACCCCCTC 
ACTCACCAGTGAGCCCCAGACCTTCATCCCTGATCAGATGGAAGCA 
GTGTGGCATGATTACAGTTCATATTTCAACTCTGCCACTCAATGAC 
TAATAGCCAAGCACTAATAATGCAGAAAATGTAAATTTAAAAAATA 
ATCTTCCTGAGATTGGTTATGAAATGCACTCAACACAGCACCATCC 
ACAGAGAGGTTCTTTTTAATTGCTCTTTTCTTTCCTCTCGACACCC 
AGAATCACAAAGCATGCCTGAAAGCGTCACACATATATGTCTGTGA 
CCATAACATGGCATTGCACATGCAAAGGAAATAA [A/G] TAGGTGT 
TACCCATGTGACAAAGGTCCATGAGCTCTGTCCGCAAAAAGCTGTT 
GAGTTTAAAGAACAAATAATTCTGAAAAATCTTCCAGGAGATGAAA 
TTTGTAGAACTCAAGGGCAGTAAACTAGCTGCTTTCCAAGGACTTG 
TCATAGCTTTATTGACTTACAATAGCCAAAGATAAGTCAGTATTAA 
TCAAACCCATTCTCTAGAAAAACCTCATCATCACTGGGGCCAGGGC 
AGAGAAGTGTGACACAGCTCTCTCCAGCTTCCCCACTTCACAGCAT 
GGTTCCACCATCCACCCAATTGCTAAAGCCTGGATAGTCTTCCTTG 
TCACCTCCCGATCCCCTTCTCTAACACCCATCCCCCGGCCACCCAA 
CATCAGCAAGTCTGGTGGTTTCTCTCTGTCACAGAGATTCAAGATC 
TTCCC 


SEQ ID 
NO. 213 


KCP 2019 
85 


TCGTAGTGTTCATAGACTCTCCTTCCTTCCTAACTTAAAAGAGGCT 
CCTTCTGGTTTTTCCTTCATACACTTCCCTCACTCTTTTCCTTCAC 
TGCACTAAAGATGATTTCTAATTGCATAGTCATTGATGCCAGTATT 
TGTTTATTGTGTCATTCCTGCTGAACAGAGGATGGGCCTGACTTAT 
TTGGGACCATGTTGCTGATGCCTGGACCTAAGCCTGGCACAGAGTA 
GGAGCTCAACAAATTTGTTAAATGAGTGGCTGAATGGCCATACTCT 


SEQ ID 
NO. 214 



-171- 





CAAAGGACCCACAGTCTAGGAGAGACAGAAGAATCTTTGTCTTTTT 
GTCTTGCAGTGGGATGGAAGCTGCAGGGAGGGGTCTTGTCACATTG 
ATACTGTCTGGGGAAGACAGAAAAACTTCAGTTTCAGAGGAGGTAG 
CCCTTGAAAC [G/A] AGATTTGAGAGAGGGCAGCACATTGTACAAC 
TCCATGGGCACCATGCACATTGTAGTCCAGATAAACAGAGCCCCTT 
GGAGATATGTGAGGCATGGGATAGACTCAGAGAAACCCAGGAAATA 
ACCCCTTCAGGCATCTGACATGCAAAGATGTGGAAGTGTCAACCAG 
GAAGTCATGTTGGGGGAACAGCAAGTATTTACAGAAAGTGACTGTG 
TGTGTCTGTGTAGGAGGGTGACTTTGTATAGGAGAGATAAAACCTG 
TGAGCTAATCAAGGAGAAGATCATAAAAGACCTTCATAAAGAGCAT 
GGCCTTTTTCCTGCAAGCAGTGAGGAGCCATTGAAGGCTTTAGCAT 
AAGGACAGTCAGATGTACTTCCCTAGAATGCACATTTCCTTCTGCT 
CCAGAACTTCTGCACAGGAGGCTCCTAAAAGCTCTCCCCATCCTCC 
CTGTACACGTAGAATCTGCCTCTGTCTCTCTTTCTCTCT 




KCP 2020 
67 


CACTTCCCTCACTCTTTTCCTTCACTGCACTAAAGATGATTTCTAA 
TTGCATAGTCATTGATGCCAGTATTTGTTTATTGTGTCATTCCTGC 
TGAACAGAGGATGGGCCTGACTTATTTGGGACCATGTTGCTGATGC 
CTGGACCTAAGCCTGGCACAGAGTAGGAGCTCAACAAATTTGTTAA 
ATGAGTGGC TGAATGGCCATACTCTCAAAGGAC CCACAGTCTAGGA 
GAGACAGAAGAATCTTTGTCTTTTTGTCTTGCAGTGGGATGGAAGC 
TGCAGGGAGGGGTCTTGTCACATTGATACTGTCTGGGGAAGACAGA 
AAAACTTCAGTTTCAGAGGAGGTAGCCCTTGAAACGAGATTTGAGA 
GAGGGCAGCACATTGTACAACTCCATGGGCACCATGCACATTGTAG 
TCCAGATAAACAGAGCCCCTTGGAG [A/G] TATGTGAGGCATGGGA 
TAGACTCAGAGAAACCCAGGAAATAACCCCTTCAGGCATCTGACAT 
GCAAAGATGTGGAAGTGTCAACCAGGAAGTCATGTTGGGGGAACAG 
CAAGTATTTACAGAAAGTGACTGTGTGTGTCTGTGTAGGAGGGTGA 
CTTTGTATAGGAGAGATAAAACCTGTGAGCTAATCAAGGAGAAGAT 
CATAAAAGACCTTCATAAAGAGCATGGCCTTTTTCCTGCAAGCAGT 
GAGGAGCCATTGAAGGCTTTAGCATAAGGACAGTCAGATGTACTTC 
CCTAGAATGCACATTTCCTTCTGCTCCAGAACTTCTGCACAGGAGG 
CTCCTAAAAGCTCTCCCCATCCTCCCTGTACACGTAGAATCTGCCT 
CTGTCTCTCTTTCTCTCTCCTCCTCCTCCTCCATCTCCTCCTCCTC 
CTCCTC 


SEQ ID 

NO. 215 


KCP 2 02 

795 


GCTCCAGAACTTCTGCACAGGAGGCTCCTAAAAGCTCTCCCCATCC 
TCCCTGTACACGTAGAATCTGCCTCTGTCTCTCTTTCTCTCTCCTC 
CTCCTCCTCCATCTCCTCCTCCTCCTCCTCTCCCTCTCTCTCGCTG 
TCTCACACACACATACACACACACTCCTTCCTTCCTATCTAGTCAG 
ATTCCACTCCTTGGGATTTCAGGCCCACCGTCACTCCTCAGGGAAG 
CCTGCCCTGAATGCCTGCACTACACCAGGGCCCCTTTCCCCTGCCC 
CCATCCCAGAGCACCAAATAGCTTTCCCTTGCAGCACTTCTCACAG 
CTGTCATTTTATGTTTGTGTCTGTGATTCTTAGGTTAAGTCCCTCA 
TGCACCAAATCATAAGATCTGGGAACAAGGACCACACCTGTCCTG [ 
C/T] TCATCACTGTAATCATCACACTGCCTGCCAAAGTGCCTTGCA 
CATATTAGATACTTAGTAGTTATGTGTTCCATGAATGACTCTTTAA 
GAGATCTTCTAGCTGTTCTTGCAAAGAACCCATTGGTAAGGTTGAA 
CCTACAGGCTGATACTTTGCACTAGTCTCAGGAAGAGATGGTGAGT 
ACATGAAATTGAGTCCCCCAGAGGTTAATGCCCAGTGCCCCAGCTA 
GGAAACGTCCAAGGAGGCAATTTGAACCCCATCTGTCTGGCTGCAG 
AGCCTAGCCCTCTAATGCATTCAGGGGTCCTAGCTCCTCGAGGATG 
CCACTGTGCCGTGAACTTCTTTCTGACCCTCATGGCTCCCAGCACA 
GCATCCACACTCAGAAGTGCAAGATGAATGTTTGCAGATAATGAAC 
ATAAAGCTCTCAGGAACCCTCATCTCCTGAGAATCTGCTTTGGCCC 
CCACAGCAGGTCTGGGTGTGGACCTTCCCCA 


SEQ ID 
NO. 216 


KCP_204 2 
42 


GTGGCAAAGTTGGGATCTTAACCCAGTTCTATGTGGCTATAAAGTT 
CATGGAATAGAATGCTGCAGTTAAGAACATGGGCTTTGGCATCAAG 
CAGACCTGTATTTGAGCCCCACCTCTGCTGTTTATTAACTGTGGCC 


SEQ ID 
NO. 217 



2345.2049-005 



-172- 





CTGGGCAGATGACCTTACATCCTTAAGTCTCTAGTTCTTTGTCTTT 
AAAAGGGTGGCAGAATGTACCTCACTGGTTTTAGGAAGGTCACATG 
AGATAGTGCACATGAAGCCCTAGGCATGGGAAAATTCTTCTAAAAT 
GTCAGCTGCCATTCTGATCACTGCAAGACCCCCACCCCCAATACTC 
CCAATTGTACCACCCCACCCCACTCACCAGTGTCTCAGAAATGCCT 
CCTCCAGAAGGAAGGCATCCTGTCTAACCCACTGCTTCTAGCCAAG 
CTGTCTTTCTTCAGAAGGTAGAAAAAGATTGTTAGTCATTGTTTAA 
TCTTTATTGAGTATATACCGCCACACCAATTGCACTGCCA [C/T] T 
CATTATCTCATTTAAATCTGACAAGAGCCTTGTAAAGTAGGGATTA 
TTCCCACCATTTCCCAGATGTTGAAACTGAAATTGATAAACACGAC 
ATGTTGCCATGGCTACATGAAGATCTCCAAGCCGGAGGATCTCCAC 
CCTCACCTGCCTAGCTTCCCAGACCTCTCTGCAGAAAAGGGACTGA 
CCCCCAAGACAGCCCTGGCCTCTGGGCTCCACCCCTTCCACATCCA 
TCCCAGGGCCGCTGAGGACTGAAGAGTTCTCCACGTTTGCCCTTTA 
AAGTGACTTAAAAATAATCTTTATGAATTTCTTCATATACAAAATT 
TGTACTTACTCATTGCAGCAAATTTAGAAAATACACATAAGCAAAA 
AAGAACGTAACAGCCATCCATAACCCTAACTCTCAGAGATCACCAC 
TATTAAAATGTTTATTATCTAAGAGAGAGATGATATAGACAAAGAT 
GAGACAGATTGACACAGAGAAGATGGGTACATGATAGAT 




KCP 2062 
67 


TGCTCAACTGTAATCAAACATTATTTTTAAAAAATCATTCCAGCCT 
GGGAAACAGTGAGAAAC CCATCTCTACAAAATAAAAAATAAAAATT 
AGCTTGGCATTGTGGCATCTGCCCGTGGTCCCTGCTACTCAGGAGG 
CTGAGATGAAAGGATCACTTGAGCCTGGGAGGTTGGGGCTGTGGTA 
AGCCGTGATTGCCCCATTGCACTCCATTCTAGGCAACAGAGTGAGA 
CCCTGTCTCAAAAAAAATATTATTCATTTAATATCTGTTGCCACCA 
CAGGACTGATCCCTCTGTGAGGGCAGAGATTGTTCATGCATGGAAT 
TGTGATTTATAAGCACTGGCTCTGGAGCCAGGTTGCCTGAGCACGG 
AGCCAGCTGTGCCCTGCGGGACACCTGTGGCACACTTCACTCCTGG 
GACACCTGGGACACGCACACAATAGAAATGTTCACATTTTACTAGG 
CAATGCCAGTCACATAGTCCTACCTAATTTCAAAAGGGTA [A/G] A 
AGGTACACCCAACACGCATCAGGAAGGAGGAGGACCAGAAATTGTT 
GGTGACAAGCACAAATGACCACCCCAATATAATATTTTGTTTGGAA 
GGCATTTTATTCCACAAAAACAACATTACAATAAACACAACAACAA 
AACACTGGTTGCAGTAGAACCAACTTTCCAGACCTATCTGCACAGC 
ACAACCATTATCCCACTCAAAATGTCATGTTTTTACCCAAAACATT 
AAAATTTTAAAAGCAATTCAAACCCATAGCTTAAAAAATGTTCCAA 
CCAGTAATAAAAGGAAAAGTGTGCCTCCTCCTCCCAACTTCCCTAC 
CCCACAATCGCAAGATATTATCCTTATAGGCGAAAAGGGTTTCAGG 
ATTTGAGATGCAGGCTGGGAGGTCTGAGAAGACTTCCTATAGAAGA 
CATGACTTCAAACTCTTTCTTGTATGTGAGATTTAATTTTCA7yVGA 
CTCCTCTGATCCAACTTAAGCTTTATGGTAAATCACCTT 


SEQ ID 
NO. 218 


KCP 2076 
61 


ATATGCCAGCGCTCTATCTGCAGGGGTTCTTTTGATAGCAGCAGAC 
TGAGAGATGATGTTACTGTCCCCTTTTTCCTGTTGTTGGCAACTGA 
GACTCAGAGGATGGAAGTGACTTGCTCAGGTCCACCACCTCTTCAG 
CTGTGGAGCTGCGACAGGAGCCTTTGTTTGACTTCAAAGCTCACCA 
TCACTCCTCTCTCACTGATGCTCAAGTGGGCTATCACCTCGCCTTT 
CCTGAGCCTTCCTTCGCTATCCTAAAACAGCGCCTCCCGAAATCAC 
CACTAAAGAACTTATTCATGTAACCAAACACCAGCGGTTCCCCTAA 
AAACCTATGGAAATAAAAATTAAAAATAAAAACAGTGCCTCCCATG 
ACCCATGTCTCTCCAGTCCCATAACTCTGCTCTATTTCCATTCACA 
GCTCCATCCCCACCTTTATGTCTTTTGTTCACTGCTTTATCCCCAG 
TGCCTAGAAGAGTGCTTGGCACCTAGTAGACACTCAGTAA [C/G] T 
ATTTGTCGAATGAGTTAATAAGGTTGTGAAAAGAACGTTAGATTAC 
TGGAAGGATTCATCTGAGTTTAATTCTGCTATGCTGGGAATCCAGT 
GTGCGGCCTTGGATGAAGCCAGTTCCCTCCCTGGGCCCCAGTAGCC 
ACATCTGTACATTTAGAGGGCAGGAGAAAAGCCACACGCTCTGTGA 
CTTATACAACTTGTTGCCCAGAGTGGAGGCTGCTTTGATGCTCAGA 


SEQ ID 
NO. 219 



2345.2049-005 



-173- 





AAAAAGAAACAAACATGGAAATGCTAAATGGGTGGCAGAGAGCTTG 
AGGGAGGAAGGAGATGGGGAGGGTACTCTTGAAACTGTTTGGTGTC 
TTCCCTCCTGCCCCCTCAGTACCAATTGTCAAGTACAGAAAGTGAA 
GGAGACTTGTATTAGTGGAATTTGGTCCCTGACTTGTTATAGAGAC 
ACAATTACAAAGACACAAGAGTGGGCCCAGCAGAGACCCTTAGGGT 
GGTCCCTTGAGGTTCCAAAGCATCTGCCCATCAAGCAGA 




KCP 2079 
65 


CACCAGCGGTTCCCCTAAAAACCTATGGAAATAAAAATTAAAAATA 
AAAACAGTGCCTCCCATGACCCATGTCTCTCCAGTCCCATAACTCT 
GCTCTATTTCCATTCACAGCTCCATCCCCACCTTTATGTCTTTTGT 
TCACTGCTTTATCCCCAGTGCCTAGAAGAGTGCTTGGCACCTAGTA 
GACACTCAGTAAGTATTTGTCGAATGAGTTAATAAGGTTGTGAAAA 
GAACGTTAGATTACTGGAAGGATTCATCTGAGTTTAATTCTGCTAT 
GCTGGGAATCCAGTGTGCGGCCTTGGATGAAGCCAGTTCCCTCCCT 
GGGCCCCAGTAGCCACATCTGTACATTTAGAGGGCAGGAGAAAAGC 
CACACGCTCTGTGACTTATACAACTTGTTGCCCAGAGTGGAGGCTG 
CTTTGATGCTCAGAAAAAAGAAACAAACATGGAAATGCTAAATGGG 
TGGCAGAGAGCTTGAGGGAGGAAGGAGATGGGGAGGGTAC [C/T] C 
TTGAAACTGTTTGGTGTCTTCCCTCCTGCCCCCTCAGTACCAATTG 
TCAAGTACAGAAAGTGAAGGAGACTTGTATTAGTGGAATTTGGTCC 
CTGACTTGTTATAGAGACACAATTACAAAGACACAAGAGTGGGCCC 
AGCAGAGACCCTTAGGGTGGTCCCTTGAGGTTCCAAAGCATCTGCC 
CATCAAGCAGATGATGTGATTAGTCTCTGTGACCCCAAGGATGCCT 
CCTGAAATTGCTGATTCAATTTCTCCTAATAAAATAGGAACAATAA 
TTAGCTAATAAGAAATCAACAATTAAAGCTATGAGAGAATTAAGTG 
AGATCATGTAAGCAAAGTACATGTCACAGTGCTCTGCAAATAGGCA 
GTGCTCAGAAGTGTCACCTTTTCTCTTTCTTCTCTGAGCCTCCGTC 
TTCTCTTCGGTAAAATGAGAATAATATTATGCATACCTCACAGGGG 
TTAAGCAATGTGAAAGTACTCTGTAAAGTATAAGGCTGA 


SEQ ID 
NO. 220 


KCP 2115 
25 


GAGATGATCAACAGTCTTTCATCCAGAGGGTTGTGTTTGCTGGTGG 
CCATTACCTTTAACATAAAACGATCATATTTACTTTATCCTATTCA 
TGTCCAACCTCAACTGACAATTGAGTTGTGTCTCTGACAATAAATA 
GCAGAAAAAGGAAATCTTCCTATACTGAAGAGAAACACAATTAATT 
AACTAGATCCATCAGGAAAGGTACAATCATGATTGAGACAGTGTTT 
AACAGATGTGACTATTGGATTCTGTTGTTGAGAATGACCCTTAAAA 
TCACAGTCAAAATATACGACAAGATGGAAATAACATTTTTGAGCAC 
CTACTATGCATGTAGAGCATCTTACATACCTTATCTCACTTAGATT 
TACAGCTGCAAGGTGGGTATGATTCTAGCTTGAATTAGTCTAATAA 
CCATATACCTCCTAGGGGCAGTGAGATGATTAGATCAATTCTAAAA 
C TATTAC CATGCTCTCTGAGCTCACCAAGACAGGCAGTTA [A/G] T 
ACAAGGATACATTAATACCGAATCCAGCAAAAGCTCACATGGCCAG 
CTTCCATTATGTTCCTATTTGTGATTATTCTGTATCAAGCACAGAA 
ATGTATGTTCACACGAACAACAAAGAAGGGGTTTATTAGTGTGGAT 
TACAGGGCCTAAGCCTACCCTCTGAAACTGGTTTTGGAGTCTTTAG 
CACGCTTGTTTGGGACAGTTAAACATGTGCCAGCTATTCTAAAACA 
GTAGCAGTAATGTGATAGAGCTGGGTCATACCGTGCTTCCCAAAGT 
ATGATCACTTCATTTCAACAACTTCACACTAACAGCCTGAACTGGG 
CTGTGAAGGGAATATTTAGACCAAGGAAACTGGAAAACTGTATCAA 
TCAGGCTTTTCCACCCTCCCCAAGAGCCAGTTGTCAGATATCTACC 
AGCCTACCAACGCTAGCTCTCTAATCAGAAACCATCACTTAGCAAG 
TTCCCAAATTATCTGCAGAGCAATGAACTCCTCTTCTTC 


SEQ ID 
NO. 221 


KCP 2118 
50 


CTATGCATGTAGAGCATCTTACATACCTTATCTCACTTAGATTTAC 
AGCTGCAAGGTGGGTATGATTCTAGCTTGAATTAGTCTAATAACCA 
TATACCTCCTAGGGGCAGTGAGATGATTAGATCAATTCTAAAACTA 
TTACCATGCTCTCTGAGCTCACCAAGACAGGCAGTTAATACAAGGA 
TACATTAATACCGAATCCAGCAAAAGCTCACATGGCCAGCTTCCAT 
TATGTTCCTATTTGTGATTATTCTGTATCAAGCACAGAAATGTATG 
TTCACACGAACAACAAAGAAGGGGTTTATTAGTGTGGATTACAGGG 


SEQ ID 

NO. 222 



-174- 





CCTAAGCCTACCCTCTGAAACTGGTTTTGGAGTCTTTAGCACGCTT 
GTTTGGGACAGTTAAACATGTGCCAGCTATTCTAAAACAGTAGCAG 
TAATGTGATAGAGCTGGGTCATACCGTGCTTCCCAAAGTATGATCA 
CTTCATTTCAACAACTTCACACTAACAGCCTGAACTGGGC [C/T] G 
TGAAGGGAATATTTAGACCAAGGAAACTGGAAAACTGTATCAATCA 
GGCTTTTCCACCCTCCCCAAGAGCCAGTTGTCAGATATCTACCAGC 
CTACCAACGCTAGCTCTCTAATCAGAAACCATCACTTAGCAAGTTC 
CCAAATTATCTGCAGAGCAATGAACTCCTCTTCTTCAGAAAGCAGG 
CTGAAAGATACACTGTTCACATCTTAGCCTGACCTGGACCCAGTGA 
GTTTCCATCAGTGAGAAAATTCTGTGCTAACTTGAGATAATACTAT 
TCTTGTGGCAATTTTACTTTTCCTTTGAGCGATTCCTTCAACCTCT 
CTCTGCCCCTTCATTTTTCCGTCTTAAAACTAT^AAGTGCCCTTTCT 
CCCTGGACACTCCTCATTTGCAATGAATTGTCATTTCAGCTCCTCA 
GTCAAGAGGAGTAATGAAATCCCACCCGTGTTAATCCTCTTATATC 
CCGCAGAAATATTGTAGACCCACTCACCCTAGGCAACAT 




KCP_2127 
75 


AGTAATGAAATCCCACCCGTGTTAATCCTCTTATATCCCGCAGAAA 
TATTGTAGACCCACTCACCCTAGGCAACATGCCCTCTCTCTTCAAC 
ACAGGTCATCAATTGTTCATTTACTGGCTATCTCCATGTACTGGAA 
CTTCAGGGTGGTGTCCAGCTGGGTTCAAAGGAGAAACAGTGGGAAG 
TTTCTCCACTGCCACCTGAATTAGATGAGAAAGAGTTGTCTACTGA 
AATACACTAGCTGGTGGCAGGATTGGGACGTCATTTGACTAATTGC 
CTCCTAGAGCTGCAGAGACTGCTGGAACTACCTAAGTAAATCATCA 
AAAAAAAAAAAAAAAAAAATCATCCCAGGGCACTTTTTCCAGACAA 
AAAGGTCCACTTAAAACATCCTCTAGAGATCTGTGCCTGAAGCTGA 
GCTGCTGCAATGAAACTGACATTTCTGCCTTGCAGCCTGGCCATGG 
GCTTAGCTGGACTAAAATGCTGCTGCAGTGGTGAGGGCAC [A/G] T 
GAGAGTCCCTAATGTACATGGCCTTGCTCCTTGTCCTGACACATCT 
TTTAGGGCTGCTGCTTTCTCTAGTGCTGGAATCTAGATAATTCCTT 
TCCCAGCCGTTTGTTTCTTCAATCTTGGAAAATATCTGGATGAATG 
TAACACTGTCACACACAAACAGAATTATGACTTACGTCACATTCTA 
TGTCGTGATTTTGTGGACTTTTAATAATTGCATTACATTTGTGACC 
ATTAATTTCCACCATCGCCCTGCTCCTGAGAATCTGTAAGGGACAT 
TTGACACTCCTCTCCCCACCCACCTCAACATTTGTGCTGACCTGAA 
GGTCACATTAAAAACATACCCATTTGGAGAGAAAGATCTGTCTACT 
GAAATACACTAAATATTGAAGAATTTCCAAGTCATTTGATCTTGAA 
AACTCCATCTAATGGAAGCAGAAACACTCAAAGGTTTTTTTTTTTG 
GACTCCCTTTTTCAGGACACTTTCAGGACTGAGGTATAT 


SEQ ID 
NO. 223 


KCP 2217 
99 


TCAGACTTTGAACAAACCTCAGAAGGAACTGTCAAGGAGGCTCCCC 
ACGGGTTCACGCTCTTCTCTCCTCCTGCACACAGGGAACAGGGCCA 
TTCTCCTTCCTTTACTGGGACTACCTGGGCTTCATCCAGGGAATCC 
CCAGGTGGCAACAGGAGGGTGGTGAAAACCGCTGCCCGTCACCTGT 
AAAGTTTCCTGTGAATGTGTCTACAGCGGCCAGCACCACAAGGCAT 
ACAAAGAAAGGGAAGGGAGAGCTGATGTGAGAGCGGCAGCGTGGGC 
ACTCCTGTGAGGTTGCCACAGCTGTAGACAAGTTAAATCAGTGCAG 
TTCAATCAAAAGTCATGACCCATGAGCGTCACAACCAGCACGAGTC 
TACAAAGGAATACATTAAAACTAAGACCAGAGCACAGCTCACATTA 
GTGAGGGATGGGATCATTTCATGGAGTTTTTGTTTCAAAATATTTC 
ATTAACATTTCACTTATATACATGTGTGTATACTGGGTTGTGAT [A 
/TJ TAAATTACAATTCTTACTATAAAATACAGCAAAAGAAAGAAGA 
AACAAAGAGAGGGCCACTGGTTTACCTAACATCCACAGGCAGGCTA 
CTTCCCAGCATCTTGAGCCCCAAAGAAGTAAATTTCCTTCCACAAC 
CGATGTTACCACAGCCTGACACTTAGCCAATGATGAAAACGAAAAA 
CAAAACAAAAGCTTGGCAGTCAGTATCCAAATATGCAGATACTACA 
GAATCTGTTTGATGTAGAAGTTGATCCTGCTACCCAGACAGCAAAC 
AACTCATTTATTAATAAAGTCCAGTTCCTCCTTAATGAAGTGGGTT 
TAATAGTTGATATCTCAATAATTACTTAGTGCATTTTTTATGAAGG 
TGATGGGAAACAAGTGCTGTTTCTTGAGTCGGAAAGAGTCTCTCAA 


SEQ ID 
NO. 224 



-175- 





GCTCCCACAAAGAAATTTCCCGAGCTTGTGAGGAATTCAGTCACAG 
GAAGATCAAGGAATT 




KCP 2235 
68 


GATCTAATGCTAGGAGATTCAAACCAACAATTAATTTCTCTGTTAA 

AATGGGTTAAAATAGATGTAAAATATTAATATGTATATAAGCATTC 

TGAATTAGACTTATGTGAATTTTTCTCCTTTTCTTTCTTTCTTTTT 

GAGAATAAGCCCTTTCATTTACGTAGAAATGCTTCAGCGTTTAGAT 

AATTGCTACTTATCTTGTTAGCTACAAACACAACCATAATTAAAGG 

CTCTGTAAGAATTATGAATTCTGGGGAAATTGGCCACTTGTCTCTG 

TGGCGTAAACAGTATCTAATTTATAACAAATCATCTGCCTTAGTCC 

CAGCAGGATAAGGTGATATGTATTGCCCAGCACATGAGAAAGATGG 

CAATTAGGAATTGTTACCAAGTTACGGGAGCCTCACACGAACATCC 

ATCACCTTTGGGGATATGTACAAGATACAAACTTAATTTGATGGAT 

TCCTTTTGTATTGGGATCAAAGTCTCAAAAGGGAAAGTGACAATTT 

CAGGGAAAATCTGGTGCAATGAGACCAACACTGATGAGAGAAATGC 

ACACAATTTAATACACCTGCTCACCTGATGTGGCAACTCAGCCTGT 

GCTTGCTGTGGGTTGCCACAGGATGAGACATGGTCTGTGCATATTC 

CCAGCAGCCACCCATCTCATCACTATTCTTGCCAGCCCAGATTTAC 

AGTTGTTCAATAGATGGATTTGGTAATATCTGCATGACAACAACAG 

GCAGAGAAGGTTAGATGGCAATTGATTCTTGATTGGTGTAAGTTTA 

TAGAACACATTCTGGCAGGGCCCAAAGGAAATCACTCACCTACCCC 

TCTGTGATGGTAAAACGTTGAAAATTCCACGGACTTGGACCTTGTG 

ATCCTTCAGTGGAAGATGGGCAGATTCCTTGCTTTAATTGACAGAC 

ACTTTCTAAATAACTAATGCAATCTTATATTACATTATAGTCCATA 

AGGGAGACATACTTAAACTACTACTTACAACAAC [G/T] GTTTTTA 

GAGCCTTTCAAATGGTTTGTACAAAGTAGCTCCCATTTAAGATATT 

TTCCTAGTATTTAAGGCTATCTAGTAGACATTACAAAACAATACGC 

TGTAAATACATTCAGATTTTTATCAGTAATACTTAACATGCCGTAA 

TTTGAACTTTCTGCTAAATCATGCTATCCATTCCTAGTTGGCCCCA 

ATGGTGAGAGTTTACTGTTTCTTTAAATAATTTTGTTTCCCTTTGC 

TGTCTAGAGGTGTTTATCATTCTGCTTACTTGCCTGTGTCTCTGGA 

ATATTCAGAAGGTTCCATGGGAAACAATTTGAATATGCAAAGAAGT 

TATTTTTAAAGCAAGGAAAATGTTTTCATATGGATTTATTTTGAGC 

ACTTCTGCCTTTGCCTCCACTGGGAACATGTTTCTCTCCAACGCCG 

AAGCCCCCTCCCTGTGTGGTGTTTGACGCAGAGGCTGACAGGGCAG 

GGAAGTGGGGTTCAAGATAGGAAGGCCATTGGCAGTGTGACCCCAG 

CCCACAGTCCTAGATCCCAGGTCGTGACACCACTCTTTTGACAGCC 

CAGATTGTTACCTAACAAGAATGACTCCCAAGCTCAACCATTCCAA 

TGCCATCTCCTCTGGTTCCAGATAAGATTGAAGATGAGCTGGAGAT 

GACCATGGTTTGCCATCGGCCCGAGGGACTGGAGCAGCTCGAGGCC 

CAGACCAACTTCACCAAGAGGGAGCTGCAGGTCCTTTATCGAGGCT 

TCAAAAATGTAAGACCCGTGCACGCTCTGAAGGCCTGGGGGGGGTT 

CCCACGTGAGGCTACACTCTCCCCAATGCCAAGGGAGCTCATAAGG 

CGTTTCCCATATGTGAGGCTGTACAAGGAAGGCCAGCTCTATAAAG 

GGGGCATGAGAGGGAGATCACCTGGCTAGAAAGGAAGGCTCCAGGC 

GAGGATGGAGCAACCTCAGGAGACAGTAAACGGCCAACTGCCCAGA 

AATTTCACAGGGTGGCACATCCTCAAG 


SEQID 
NO. 225 


KCP_1152 


GATTTTTATCAGTAATACTTAACATGCCGTAATTTGAACTTTCTGC 
TAAATCATGCTATCCATTCCTAGTTGGCCCCAATGGTGAGAGTTTA 
CTGTTTCTTTAAATAATTTTGTTTCCCTTTGCTGTCTAGAGGTGTT 
TATCATTCTGCTTACTTGCCTGTGTCTCTGGAATATTCAGAAGGTT 
CCATGGGAAACAATTTGAATATGCAAAGAAGTTATTTTTAAAGCAA 
GGAAAATGTTTTCATATGGATTTATTTTGAGCACTTCTGCCTTTGC 
CTCCACTGGGAACATGTTTCTCTCCAACGCCGAAGCCCCCTCCCTG 
TGTGGTGTTTGACGCAGAGGCTGACAGGGCAGGGAAGTGGGGTTCA 
AGATAGGAAGGCCATTGGCAGTGTGACCCCAGCCCACAGTCCTAGA 
TCCCAGGTCGTGACACCACTCTTTTGACAGCCCAGATTGTTACCTA 
ACAAGAATGACTCCCAAGCTCAACCATTCCAATGCCATCT [C/T] C 


SEQ ID 
NO. 226 



2345.2049-005 



-176- 





TCTGGTTCCAGATAAGATTGAAGATGAGCTGGAGATGACCATGGTT 
TGCCATCGGCCCGAGGGACTGGAGCAGCTCGAGGCCCAGACCAACT 
TCACCAAGAGGGAGCTGCAGGTCCTTTATCGAGGCTTCAAAAATGT 
AAGACCCGTGCACGCTCTGAAGGCCTGGGGGGGGTTCCCACGTGAG 
GCTACACTCTCCCCAATGCCAAGGGAGCTCATAAGGCGTTTCCCAT 
ATGTGAGGCTGTACAAGGAAGGCCAGCTCTATAAAGGGGGCATGAG 
AGGGAGATCACCTGGCTAGAAAGGAAGGCTCCAGGCGAGGATGGAG 
CAACCTCAGGAGACAGTAAACGGCCAACTGCCCAGAAATTTCACAG 
GGTGGCACATCCTCAAGGAATTCACCCTGGCCCAGGGTCAAGCCTT 
AGCCCTTAACATAATCATACCTTCCAACCTGGTGGTGCCCCCACAA 
TAATGGGATTTGGCCCTGCTGACTTATGCTAACCAGGCT 




KCP_1333 


GGCAGGGCCCAAAGGAAATCACTCACCTACCCCTCTGTGATGGTAA 

AACGTTGAAAATTCCACGGACTTGGACCTTGTGATCCTTCAGTGGA 

AGATGGGCAGATTCCTTGCTTTAATTGACAGACACTTTCTAAATAA 

CTAATGCAATCTTATATTACATTATAGTCCATAAGGGAGACATACT 

TAAACTACTACTTACAACAACTGTTTTTAGAGCCTTTCAAATGGTT 

TGTACAAAGTAGCTCCCATTTAAGATATTTTCCTAGTATTTAAGGC 

TATCTAGTAGACATTACAAAACAATACGCTGTAAATACATTCAGAT 

TTTTATCAGTAATACTTT^ACATGCCGTAATTTGAACTTTCTGCTAA 

ATCATGCTATCCATTCCTAGTTGGCCCCAATGGTGAGAGTTTACTG 

TTTCTTTAAATAATTTTGTTTCCCTTTGCTGTCTAGAGGTGTTTAT 

CATTCTGCTTACTTGCCTGTGTCTCTGGAATATTCAGAAGGTTCCA 

TGGGAAACAATTTGAATATGCAAAGAAGTTATTTTTAAAGCAAGGA 

AAATGTTTTCATATGGATTTATTTTGAGCACTTCTGCCTTTGCCTC 

CACTGGGAACATGTTTCTCTCCAACGCCGAAGCCCCCTCCCTGTGT 

GGTGTTTGACGCAGAGGCTGACAGGGCAGGGAAGTGGGGTTCAAGA 

TAGGAAGGCCATTGGCAGTGTGACCCCAGCCCACAGTCCTAGATCC 

CAGGTCGTGACACCACTCTTTTGACAGCCCAGATTGTTACCTAACA 

AGAATGACTCCCAAGCTCAACCATTCCAATGCCATCTCCTCTGGTT 

CCAGATAAGATTGAAGATGAGCTGGAGATGACCATGGTTTGCCATC 

GGCCCGAGGGACTGGAGCAGCTCGAGGCCCAGACCAACTTCACCAA 

GAGGGAGCTGCAGGTCCTTTATCGAGGCTTCAAAAATGTAAGACCC 

GTGCACGCTCTGAAGGCCTGGGGGGGGTTCCCAC [A/G] TGAGGCT 

ACACTCTCCCCAATGCCAAGGGAGCTCATAAGGCGTTTCCCATATG 

TGAGGCTGTACAAGGAAGGCCAGCTCTATAAAGGGGGCATGAGAGG 

GAGATCACCTGGCTAGAAAGGAAGGCTCCAGGCGAGGATGGAGCAA 

CCTCAGGAGACAGTAAACGGCCAACTGCCCAGAAATTTCACAGGGT 

GGCACATCCTCAAGGAATTCACCCTGGCCCAGGGTCAAGCCTTAGC 

CCTTAACATAATCATACCTTCCAACCTGGTGGTGCCCCCACAATAA 

TGGGATTTGGCCCTGCTGACTTATGCTAACCAGGCTCACCGAGACT 

GATGTGTAAGCCGAATGTCGGTGTATTAATTTACCTTGGGAAATGG 

AACTGACAGTGGAAACAGACACTCCTCTCCCTTCGCTGGGACCCGC 

TCTCCTTGGAAGCCACATGGAAGCCAGGTTACAATCAAAAGTGGAG 

TCAGAGGACGGGAGTTCCTTGTTTAGTTGTTACTTTAAATACATTA 

ATGTGTTCCTGCAGTCTCAGGCCAGTTTGAGAGCTCTCAGATACAA 

TCCTGGATATTAATTTATTTTTTAAGTTTAACTCTCAGAGTGCAAT 

CTTATTCCCAAATCCTGGAGTGGTGTGGAGTGGGGTGGGCTACAGC 

GACATGCACCTGGTCACCCTCCCTCCAGGTGCAGTCTGTAGGTAGA 

GCTGAGCTGGGTCAGTTCCAAACTGACCACAGCCTCAATGTTCTCC 

AAACTGCTGACCCACAGGGATTCCAGCCCCTCCTGGGAGTTATCTG 

ACAGGTGCTGGGATGCCTCTTCCTTCCACACTAGCCTTGACTGCAC 

ATGCCAAGTGCCCAGTTTCCTACCATTAGGGCTTCTTTCCTTCGAT 

GGCAGCATTAGCAGTGGGCAGCCGAGTTGGAGAAGGATCCTGTGGG 

AAAGTTTTCCAGGCAGGCACTGGGCTCAGAGGGAACAGCATCCAGA 

AAAGAGAAGAAATCTACACTGCTTGGC 


SEQ ID 
NO. 227 


KCP 2252 
20 


AATTTACCTTGGGAAATGGAACTGACAGTGGAAACAGACACTCCTC 
TCCCTTCGCTGGGACCCGCTCTCCTTGGAAGCCACATGGAAGCCAG 


SEQ ID 

NO. 228 



-177- 





GTTACAATCAAAAGTGGAGTCAGAGGACGGGAGTTCCTTGTTTAGT 
TGTTACTTTAAATACATTAATGTGTTCCTGCAGTCTCAGGCCAGTT 
TGAGAGCTCTCAGATACAATCCTGGATATTAATTTATTTTTTAAGT 
TTAACTCTCAGAGTGCAATCTTATTCCCAAATCCTGGAGTGGTGTG 
GAGTGGGGTGGGCTACAGCGACATGCACCTGGTCACCCTCCCTCCA 
GGTGCAGTCTGTAGGTAGAGCTGAGCTGGGTCAGTTCCAAACTGAC 
CACAGCCTCAATGTTCTCCAAACTGCTGACCCACAGGGATTCCAGC 
CCCTCCTGGGAGTTATCTGACAGGTGCTGGGATGCCTCTTCCTTCC 
ACACTAGCCTTGACTGCACATGCCAAGTGCCCAGTTTCCT [A/G] C 
CATTAGGGCTTCTTTCCTTCGATGGCAGCATTAGCAGTGGGCAGCC 
GAGTTGGAGAAGGATCCTGTGGGAAAGTTTTCCAGGCAGGCACTGG 
GCTCAGAGGGAACAGCATCCAGAAAAGAGAAGAAATCTACACTGCT 
TGGCATCTACCATGGACTCAATACCACCTAACATAGGTTCATAAGA 
TACCCTTGGGGAAGTTATTGTTACCCCCATTTTACAGGTAAGGATA 
TTGAGGATCAGAGACTGGCTTGGCCAAAGTCACAAAGCTTAGTATT 
GGCTGAGCCAGGATTTAAACCCAGGTTTTTCTGATCTTAAAGCCCC 
AAATCTCTCCACCTCACAGTGCCCATTCTCTGACAATGTCTCATCA 
TTTTGCAAAGCAGCTCCAGTCCTGAGATGGCACTACTTGGGAGAAG 
TGGAAATGCACAGGTCCCTGTCCCTGGGGATCATGAGGAACCCCAG 
ACACCAAGGCTGGGCCCAGTCTTCTCCTAGTGCTGGCCC 




KCP_2649 


GGCTCACCGAGACTGATGTGTAAGCCGAATGTCGGTGTATTAATTT 

ACCTTGGGAAATGGAACTGACAGTGGAAACAGACACTCCTCTCCCT 

TCGCTGGGACCCGCTCTCCTTGGAAGCCACATGGAAGCCAGGTTAC 

AATCAAAAGTGGAGTCAGAGGACGGGAGTTCCTTGTTTAGTTGTTA 

CTTTAAATACATTAATGTGTTCCTGCAGTCTCAGGCCAGTTTGAGA 

GCTCTCAGATACAATCCTGGATATTAATTTATTTTTTAAGTTTAAC 

TCTCAGAGTGCAATCTTATTCCCAAATCCTGGAGTGGTGTGGAGTG 

GGGTGGGCTACAGCGACATGCACCTGGTCACCCTCCCTCCAGGTGC 

AGTCTGTAGGTAGAGCTGAGCTGGGTCAGTTCCAAACTGACCACAG 

CCTCAATGTTCTCCAAACTGCTGACCCACAGGGATTCCAGCCCCTC 

CTGGGAGTTATCTGACAGGTGCTGGGATGCCTCTTCCTTCCACACT 

AGCCTTGACTGCACATGCCAAGTGCCCAGTTTCCTACCATTAGGGC 

TTCTTTCCTTCGATGGCAGCATTAGCAGTGGGCAGCCGAGTTGGAG 

AAGGATCCTGTGGGAAAGTTTTCCAGGCAGGCACTGGGCTCAGAGG 

GAACAGCATCCAGAAAAGAGAAGAAATCTACACTGCTTGGCATCTA 

CCATGGACTCAATACCACCTAACATAGGTTCATAAGATACCCTTGG 

GGAAGTTATTGTTACCCCCATTTTACAGGTAAGGATATTGAGGATC 

AGAGACTGGCTTGGCCAAAGTCACAAAGCTTAGTATTGGCTGAGCC 

AGGATTTAAACCCAGGTTTTTCTGATCTTAAAGCCCCAAATCTCTC 

CACCTCACAGTGCCCATTCTCTGACAATGTCTCATCATTTTGCAAA 

GCAGCTCCAGTCCTGAGATGGCACTACTTGGGAGAAGTGGAAATGC 

ACAGGTC C C TGTCCCTGGGGATCATGAGGAACC C [C/TJ AG AC AC C 

AAGGCTGGGCCCAGTCTTCTCCTAGTGCTGGCCCTCAAATGCCTCC 

CGCTGACTCTCTCCCCTTCCCACAGGAGTGCCCCAGTGGTGTGGTC 

AACGAAGACACATTCAAGCAGATCTATGCTCAGTTTTTCCCTCATG 

GAGGTGAGTCTGACCTTGAAATCTATCTTGCCCAGCTCCCTCTCTG 

GTAAGCAGCCTTCCCTTCCTCCAAGTCCTCTCTTCCTTGCCATTTG 

CTTCCTTCTCGAGGAAGAGACAAACTCAGGGCAGGACACCTCCCTC 

ATCGTGAGAGGTGGGAGTCTCCAAAGCTTTAGCAGGAAAGAACTCT 

GAAAATGAACCCACCCTGGAAGGGGAAGAAGGGCTGATAATGCAAC 

ATCACAACGTCTCAGAACAGCTCTAGAAAGCAGGTATTATAATCCC 

AGATGGAGTAACTGAGTTTCGGGGAAGATAAGCAGTGTACTCAAGA 

TTGCACAGCTGGTGAGTAGCAAACCAGGATTAGATTCCATAAGGGT | 

CTGAAACAGGTTTTGCCATGCTGGCACCACCATTGTGCAGGGCACT 

TTTGAATCTTTTCCTTAAAATAGCTGAGACAAGCTGGAATTTTGTA 

AAAGAACTTCAGTAAATACCGAAGACTATAAAAATAAACTAATTGA 

AAAAGAGGCAGGAAACATAAAGTTGTGCTTATTAAGCCAGTTTACA 


SEQ ID 

NO. 229 



2345.2049-005 



-178- 





AGTGTGCCAGGCCCACAACAGCTGCTCTGTTGCCCTGCCCGACTCC 
TGTGGGAAC CAGCTGTGTCCC C ATGGGC CTGGG AC C ACATCGGTGA 
CTCCTCCTGTGGCCTCCATGTGTCACATGCCACTTTGCATCCTGTC 
ACCAAGAGCTGTCTCCTGCAAGACATCTTCCCTGGATCCTGACAAA 
ATGCAAATCCAAGTATTCCAAACACTTCTTGGGCCCTGTTTCTCAT 
GGGCCTTTTTGGCAGCAGACAGATGCCTTCCTTGGTGTGTGGGGCC 
CCTACCCAGATCAGGTGGGGGAGGCAG 




KCP 2278 
71 


CCTCTGGTTCTGCATCACCTCCCCCTCTAAATCTCAAGGCATTGGG 
GGAAGGTCTGGACCATCAAAAGCTCTCAGTCAGACCAAAGACATGT 
TTATC CATTTGTAAGCATTTC CTAAAGATGGGGAAAAGCAGCAGC A 
ACTTTCCCTGGCCTGCAGGAACTCAGGGACTCAGGGGACTAATAAC 
AACAGTGTATGAGCTTCCGGGCACACTGCTTCCCAGTGGCAGCCCC 
TGTACTTAGGGCTTTGTATGTATTAATTCATTTACTCCAATTCCCA 
CAATAACCCTATAGGGTAGGGTTTTATTATTGATTACCTTTTTACA 
GAAGAGGAGAGTAAGGCAAAGAGAGATAGAGTAGTTTTCCCAAGGT 
CAAAGAGCACATAAATGATAAAGGATGGATTTGAATGTAGGCAGAA 
TGACCCTCAATACAGACTGTTCCTACAGTCCACGTCCTCAGCCACT 
AGACCATACGGCCACTGGGATGATAGACAGACCACTGCAG [C/G] C 
ATGGATAAGGCAAAAACAGGGCTGGCTGTGTTGATCTGTGTCTCTC 
AGAGCTCCATTCTTCCTCAAGGGGGCACCTTGCAAAAAAAAACAAA 
AAAATGGGGCAGGGTAGGGAACTGAAGGCAGGAGCTCTTCACAGAG 
CATAGCCACATCCTCCAGGCAGACAAGAGGACGCAGGAGGCACCAT 
TCTGTGAGAGTATCACAGTCTGACCCAAAGACACAGCTTCACACTG 
TCTGATGGCTTGATGGTTAATGTCACTCTGCCTTTTCCCCTTCTCA 
GGACTTTGTAACCGCTCTGTCGATTTTATTGAGAGGAACTGTCCAC 
GAGAAACTAAGGTGGACATTTAATTTGTATGACATCAACAAGGACG 
GATACATAAACAAAGAGGTAAGTGAGCTGGGGCCAGGGGTGTGAGA 
GGGCTCCAGTGAAGGTAACTAACCCAACAGAAAACAGCCCCAGGCA 
TGAGGATAGCACTGTCTGAATGAGGCAGGCTCTGCTTTG 


SEQ ID 
NO. 230 


KCP 2279 
87 


TGTGCCATTCATACACCAACGACTCCATGCATAGACAGGCAGGAGA 
ATGGTTTTCTCATGATGGCTAGAGGGAGGGGCAAGGGCTCATCTCA 
CTTTTTGCTAGATCTAACTTCACACCCAAACCCAAAGAGTTGAGTC 
AATGGGCCCCACTCCATAATTTTCTCCTTTCCATCACCCTAGCATC 
ACTCTCCTCTCTTTCTTGTCGAAGCCCTGCCTTGTTTGGAAGGTTC 
TCCCTGTGTGGAATTCCTGCCCCCATCACCTGCCCTCCTTTTCTGC 
CTTGTAGATGCCAGCACGTATGCCCATTACCTCTTCAATGCCTTCG 
ACACCACTCAGACAGGCTCCGTGAAGTTCGAGGTACGCTCATCTGG 
GGTCCACTCTAGGGGTCCTCTGGTTCTGCATCACCTCCCCCTCTAA 
ATCTCAAGGCATTGGGGGAAGGTCTGGACCATCAAAAGCTCTCAGT 
CAGACCAAAGACATGTTTATCCATTTGTAAGCATTTCCTAAAGATG 
GGGAAAAGCAGCAGCAACTTTCCCTGGCCTGCAGGAACTCAGGGAC 
TCAGGGGACTAATAACAACAGTGTATGAGCTTCCGGGCACACTGCT 
TCCCAGTGGCAGCCCCTGTACTTAGGGCTTTGTATGTATTAATTCA 
TTTACTCCAATTCCCACAATAACCCTATAGGGTAGGGTTTTATTAT 
TGATTACCTTTTTACAGAAGAGGAGAGTAAGGCAAAGAGAGATAGA 
GTAGTTTTCCCAAGGTCAAAGAGCACATAAATGATAAAGGATGGAT 
TTGAATGTAGGCAGAATGACCCTCAATACAGACTGTTCCTACAGTC 
CACGTCCTCAGCCACTAGACCATACGGCCACTGGGATGATAGACAG 
ACCACTGCAGCCATGGATAAGGCAAAAACAGGGCTGGCTGTGTTGA 
TCTGTGTCTCTCAGAGCTCCATTCTTCCTCAAGGGGGCACCTTGCA 
AAAAAAAACAAAAAAATGGGGCAGGGTAGGGAAC [C/T] GAAGGCA 
GGAGCTCTTCACAGAGCATAGCCACATCCTCCAGGCAGACAAGAGG 
ACGCAGGAGGCACCATTCTGTGAGAGTATCACAGTCTGACCCAAAG 
ACACAGCTTCACACTGTCTGATGGCTTGATGGTTAATGTCACTCTG 
CCTTTTCCCCTTCTCAGGACTTTGTAACCGCTCTGTCGATTTTATT 
GAGAGGAACTGTCCACGAGAAACTAAGGTGGACATTTAATTTGTAT 
GACATCAACAAGGACGGATACATAAACAAAGAGGTAAGTGAGCTGG 


SEQ ID 
NO. 231 



-179- 





GGCCAGGGGTGTGAGAGGGCTCCAGTGAAGGTAACTAACCCAACAG 
AAAACAGCCCCAGGCATGAGGATAGCACTGTCTGAATGAGGCAGGC 
TCTGCTTTGGGGCTAACAGAGCTGGTCCCTGGCAAAATAAAGAAGG 
CCTCCCTCATTGCCCTACCCTGCCCTGTTCCCAAGCGCCCAGAAAG 
GATTAAACAGATTCATTCTCACTGGGTCACCTAGATTCAGTAGATA 
TTACACAGTGGATAAAAATGACTTGTTTCAGTGTGAAGAGTTACTC 
TTCCCTAGGGAACCTGCATTTGGGAAGGTTAGGAGCCACAAGTCAA 
AGCTAAAAGTTGAAATGGTGGAATTGTAGGCAGCACCTAGAATAGA 
AAAGAAAGATTTTTAAGGAAGAGGAACCTACAATTGGGTCATATTG 
GCCTTAAACTATTTTGCCTATTAATACAACCGCCAAGGGGGTAATG 
GAAGGTACAGCTGTCTTTACAGAAATTATCACAAATAATTTCTGAA 
TCTTCACTGCTTTGCACTTTTAGAACCTCAGAGGACATGTCTCTAG 
CCAGTGAAATACCCTCAGGTCTATCTCAAAACTCACTTTGGTATCC 
ACTGTATCCTGGTATCTCAGTGGAAGCTGGAAATTGGCATCCTGTA 
ACACTCCACTTGCTGAGCTCCTGTGTGCCAGGCACGGTGCCTGGAG 
GTATAGATATCAGCACCAATCTTCACC 




KCP 2281 
07 


TAGGGCTTTGTATGTATTAATTCATTTACTCCAATTCCCACAATAA 
CCCTATAGGGTAGGGTTTTATTATTGATTACCTTTTTACAGAAGAG 
GAGAGTAAGGCAAAGAGAGATAGAGTAGTTTTCCCAAGGTCAAAGA 
GCACATAAATGATAAAGGATGGATTTGAATGTAGGCAGAATGACCC 
TCAATACAGACTGTTCCTACAGTCCACGTCCTCAGCCACTAGACCA 
TACGGCCACTGGGATGATAGACAGACCACTGCAGCCATGGATAAGG 
CAAAAACAGGGCTGGCTGTGTTGATCTGTGTCTCTCAGAGCTCCAT 
TCTTCCTCAAGGGGGCACCTTGCAAAAAAAAACAAAAAAATGGGGC 
AGGGTAGGGAACTGAAGGCAGGAGCTCTTCACAGAGCATAGCCACA 
TCCTCCAGGCAGACAAGAGGACGCAGGAGGCACCATTCTGTGAGAG 
TATCACAGTCTGACCCAAAGACACAGCTTCACACTGTCTG [A/T] T 
GGCTTGATGGTTAATGTCACTCTGCCTTTTCCCCTTCTCAGGACTT 
TGTAACCGCTCTGTCGATTTTATTGAGAGGAACTGTCCACGAGAAA 
CTAAGGTGGACATTTAATTTGTATGACATCAACAAGGACGGATACA 
TAAACAAAGAGGTAAGTGAGCTGGGGCCAGGGGTGTGAGAGGGCTC 
CAGTGAAGGTAACTAACCCAACAGAAAACAGCCCCAGGCATGAGGA 
TAGCACTGTCTGAATGAGGCAGGCTCTGCTTTGGGGCTAACAGAGC 
TGGTCCCTGGCAAAATAAAGAAGGCCTCCCTCATTGCCCTACCCTG 
CCCTGTTCCCAAGCGCCCAGAAAGGATTAAACAGATTCATTCTCAC 
TGGGTCACCTAGATTCAGTAGATATTACACAGTGGATAAAAATGAC 
TTGTTTCAGTGTGAAGAGTTACTCTTCCCTAGGGAACCTGCATTTG 
GGAAGGTTAGGAGCCACAAGTCAAAGCTAAAAGTTGAAA 


SEQ ID 

NO. 232 


KCP 2325 
21 


ATTTCTTAAAGTAGATAAATTTGACTTTATCAAAGTTAAAAATTTT 
GTGCTTTAGAAGACACCTTTAAGAAAATGGAAATGCAAGCCATGGA 
CTTGGAAAAAATGTTTGCAAATTATATACCAGATATATAAAGATAC 
CAGGATACCAAACCAATATAAAGACTGGCATCCAAAATATATAAGG 
GACATTTATAATTTAATACAAAGATAAACAACTTCATATAAAATAG 
GCAAAAGATTTGATGAGATATTTAAGAAAAGAAGATATATGAATGG 
CCAGTAAACCCATGAAAGGTTGCTCTATATCACTGGTCTTCAAAGA 
AATGCAAATTATAACTATAATGAAATACAATTGCACAGAATGGCCA 
CAATTAAAAAGACTGATAATACCAAGCATTGGCAAAGATGTGGAGC 
AATAGAAACTCTCATAGATAGCTGGCAGAAATGTAAATGGTACAAA 
CACGTTGGGAAACATTTTGGCATCTTTGATAAAGCTCAGCACACAC 
TTAACATACAACCCAGAAATCCCATTCCAGTCAGGCATGGTGGCTT 
ACGCCTATAATCCCAGTACTTTGGGAGGCTGAGGCAGGCGGATCAC 
TTGAGCTCAGGTGTTCAAGACCAGACTGGGCAACATGGCGAGACAC 
TGTCTCTACTAAAAATACAAAAAAAAAAAAAAAAAAAGCCAGACAT 
GGTGGTAAGCACCTGTGGTCCCAGCTACTAGGGAGGCTGAGGTGGG 
AGAATTGCTTAACCCTGGGGAGTGGAGGTTGCAGTGAGCTGAGATT 
GCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCA 
AAAAAAGAAAAAAAGAAGAAGAAAAGAAGTCCCACTCCTGGATATT 


SEQ ID 

NO. 233 



2345.2049-005 



-180- 





TACCCCCAAAAGAAAAATATGTAATTCCATAAAGACTTGTACAAAG 
ATGTTCATAGCAGCTTTATTCATAGTAATCTCAAAACTTAAATGAC 
CCAAATGTCTGTCAACAGGACAATGGGTAAATAC [A/T] TCATAGT 
CTGTTCATCCAATGGAATATTACTCAGCAGTAAAAAGGAATGTTAT 
AGTTGCATGCAGCAATGTGTATGAAGCTCATAAACCTCATGCTGAG 
TAAATGAAGCCAGACGCAAATGAGTTTACACTGTTTTACTCCATTT 
ACATGAGATTTTAGAAAATACAAACTAATCTATAGTAACAGAAATT 
AGATCTGTGGTTGCCTGGTGTCAAAGCTTGAGAGGCACTCACTGCG 
AAGAAGTGTGAAGGGATGTCTTTTGGTTGTGAAAATGTTCTATATC 
TTGAGTGTGGTGGAGGTTACATGGGTGGATACATTTGTCAACATTC 
ATCAAACAGTACACTTAAAATGGGTGAATTTGTTATAAGTAAATTA 
TGCTCCAATAAATTTGATTTATTTGTTGAAAAACTTGGTGTAAGGG 
GGAAGTGCCTAACCAATAGAAGACACTCAAAAAATGTGTTGAAGGA 
AAAAAATCCTGTGAAATAAAGCAGGTAAGAGAAAATAAGAACTCAA 
TATCATCCAAAATATAGATTACAAATCCTAAATGAGATAATAGGAA 
ATTAATCCCAGTGCTCTGTTTAAAGGCTCATACCTGTAATCCCAAC 
ACTTTGGGAGACTGAGGCAGGAGGATGGGTTGAGCCCAGGAGTTCA 
AGACCAGCCTGGTCAACATAGGGAGAGCCTGTCTCTTCAAAACAAA 
AATTTAAAAATTACCTGGGTGTAGTGGCACGTGCCTGTGCTCCCAG 
CTACTCCAGAGGCTGAGGCAGGAGGATAGCTTGAGCCCAGGAGTTC 
AAGCCTGCCCTGAGCCATAATCACTGCACCACACTCCAGCCTGGGC 
AACAGAACAAGACCCTTCCTCAAAAAAGCAATAAAATAAAATAAAG 
AAATGCACATGACTAACATAGGGTTTATTCCAGGAATGCAGGAATA 
GCCCAGTAGCAGAGAAAGCCTATTAAATAATTTATCACATTAATAT 
ATCAAAAGATCAAACCATTTGATGCTA 




KCP 2336 
55 


TTTACTCCATTTACATGAGATTTTAGAAAATACAAACTAATCTATA 

GTAACAGAAATTAGATCTGTGGTTGCCTGGTGTCAAAGCTTGAGAG 

GCACTCACTGCGAAGAAGTGTGAAGGGATGTCTTTTGGTTGTGAAA 

ATGTTCTATATCTTGAGTGTGGTGGAGGTTACATGGGTGGATACAT 

TTGTCAACATTCATCAAACAGTACACTTAAAATGGGTGAATTTGTT 

ATAAGTAAATTATGCTCCAATAAATTTGATTTATTTGTTGAAAAAC 

TTGGTGTAAGGGGGAAGTGCC TAACCAATAGAAGACACTCAAAAAA 

TGTGTTGAAGGAAAAAAATCCTGTGAAATAAAGCAGGTAAGAGAAA 

ATAAGAACTCAATATCATCCAAAATATAGATTACAAATCCTAAATG 

AGATAATAGGAAATTAATCCCAGTGCTCTGTTTAAAGGCTCATACC 

TGTAATCCCAACACTTTGGGAGACTGAGGCAGGAGGATGGGTTGAG 

CCCAGGAGTTCAAGACCAGCCTGGTCAACATAGGGAGAGCCTGTCT 

CTTCAAAACAAAAATTTAAAAATTACCTGGGTGTAGTGGCACGTGC 

CTGTGCTCCCAGCTACTCCAGAGGCTGAGGCAGGAGGATAGCTTGA 

GCCCAGGAGTTCAAGCCTGCCCTGAGCCATAATCACTGCACCACAC 

TCCAGCCTGGGCAACAGAACAAGACCCTTCCTCAAAAAAGCAATAA 

AATAAAATAAAGAAATGCACATGACTAACATAGGGTTTATTCCAGG 

AATGCAGGAATAGCCCAGTAGCAGAGAAAGCCTATTAAATAATTTA 

TCACATTAATATATCAAAAGATCAAACCATTTGATGCTAAAATCAC 

ATTTGATATAATTTACCATTTATTCATAATAATTTTCAGGATTCAA 

TTAATTAGGAATAAAATACTTCTTCAGCATAATAGAAAATACCCCA 

GCCTGGTACACAGCTTCATACTTTATGGTAACAC [A/G] CGGAGAT 

TCTCACTGAAGAAAAGATGAGGCAAGAAAAGATGATGAAGAAAAGA 

TGAGGCAAGAAAAGATGATGTCTGCACACTGTCAGACATCACCACT 

GTTTAACATTTCCTGAAAGCTCTTCAAACACAGTGAAACAGAAAAG 

GAAATGCGATCTAAATAGGAAAAATTACAACATTCCTTGTTAATGA 

CATGATTTTCTATCTGAGAAAAAAGACAGCAAGAAAATCAACTTAA 

AACAACTAGAACTTTTAAAAAGCTGGCAAAGTGACTGGTAATAAAA 

TACATATGCAAAAAGAAATTGTGTAGCCAATATATCAGTTGTGACT 

AGCTAGAAAATTGTAATACAAATATTCTCATTGTGATCACAATAAA 

ATTTAAAGCACATGGGCATTTTTAAATATCCATAATTTAGATGAAG 

AGAAAGAAAATTTTGATAAGTAGAGAAACATACCATCTTCTGAAAG 


SEQ ID 
NO. 234 



-181- 





GATGTATATTATAAAGATAGCAATATTATAATGACAGCAATTCTTC 
TCTAATTAAATTTATTTTATTTTGAATCAAAATGGAAGTGTTATTT 
GGGAAGGAAATTTGGCACAATTGTTATAAAGTTACATTGGAAGATT 
AATCAGATGAAAATAGCAAAGATAATTTTCAAAAAGAAGAAAAATG 
GTGGGATTTGTTCTACCAGATACTGAAATATATTATAAAGCTGAAA 
CTATTAAAATATTATAATATCAGAGAAGGAACAGGTAGATCAATGG 
AACAAAATAGAAATCCCAGGTACAAATACCATCTTGGTTCATAATA 
AAGGGAGCATATTGAATAGAGAGGTAATGAATCATTAAATGATTCT 
TGGAAAACTGGTTAACTATTTTGGCAATAAGTAAGTAAATATTCTT 
ACTCGGTACCATAAACACAAAATCACTATAGATATGTACAGTTGCT 
TTTTAACTAAAAAAGAACTAAAAATCATATGTGAATATCTGATCAA 
AGAATGGAAAAAGCATAAAATCAAAGT 




KCP_2375 
05 


GCCTGTAGTCCCAGCTACTTGAGAGGCTGAGGCGGGAGGATCACTT 
GAACCCGGGAGGTCGAGGCTGCAGTGACGGGGATTGTGCCACTGCA 
CTCCAGCCTGGGTGACAGAGCAAGAACCTGTCTCAAAAAAAAAAAA 
AAAGAAAAAAGAAAAAAAGAATGAGAAACTCATACAGATTAGAAGA 
GACTAAGGAGACACAACAAATAAATGCAATGTAGAATCATTGAAGG 
GAAAAAAATATTAGTTGAAAAGCTGAGATCCCGCCACTGCACTCCA 
GCCTGGGCCACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAGAAAAGCTGATAAAATTTGAATAAGCCCTGTAG 
TTTAGTTAATAATAGTGAAGCCATGTTAATTTCCTGGGTTTGGTCA 
TTGTGCTCTGGTTATGCAAGTTGTTAACATTAGAGGAGACTGAGTG 
AAAGGTATGCATGAACTCTCTGTACTAATTTTGTAAATTT [C/T] C 
TGTAAGTCTAAAATTATTCATAATATGCAAAAATTAAACAAAAAAT 
AAAATAAAATAAGCACATGGAATGAGACTGTCCCCTGGGTCTCTGT 
AGAAACCAGGTCAAACATCCCAAATGCTCTTTTACCCCCATTCTGA 
GTTGGGCCAGAATGGTCAGAATAATGGTTCCCAATGTACCTTGATA 
AACACGGAAACTCTCAGGACCGAGTCCTAAGGTTCTCTGATTCAAT 
AGGTTTGGAGTGGACTTGAGAACTGATCTTTTTAATAAGGGCCTCA 
GTCTGTGGAACTATTGGCCTCATGTGCCCTGTGGATAATCTTGGCT 
GTTGGTTCATTTTTCTTAACTGAAAACAGTGGCAGAAACTATGGGG 
ATTTTTAAATCTCTAGGCTAGAACATTAACTTTTTAAAAATTCAGA 
ATAGTATTTTATTTGCCTCAAGCCTGTGAATGGGGATCCCACAAAT 
CACCCCCCACTGAAGACAATGCCCATAACAAGGTAACCT 


SEQ ID 

NO. 235 


KCP_154 0 
0 


TTATGCAAGTTGTTAACATTAGAGGAGACTGAGTGAAAGGTATGCA 
TGAACTCTCTGTACTAATTTTGTAAATTTTCTGTAAGTCTAAAATT 
ATTCATAATATGCAAAAATTAAACAAAAAATAAAATAAAATAAGCA 
CATGGAATGAGACTGTCCCCTGGGTCTCTGTAGAAACCAGGTCAAA 
CATCCCAAATGCTCTTTTACCCCCATTCTGAGTTGGGCCAGAATGG 
TCAGAATAATGGTTCCCAATGTACCTTGATAAACACGGAAACTCTC 
AGGACCGAGTCCTAAGGTTCTCTGATTCAATAGGTTTGGAGTGGAC 
TTGAGAACTGATCTTTTTAATAAGGGCCTCAGTCTGTGGAACTATT 
GGCCTCATGTGCCCTGTGGATAATCTTGGCTGTTGGTTCATTTTTC 
TTAACTGAAAACAGTGGCAGAAACTATGGGGATTTTTAAATCTCTA 
GGCTAGAACATTAACTTTTTAAAAATTCAGAATAGTATTTTATTTG 
CCTCAAGCCTGTGAATGGGGATCCCACAAATCACCCCCCACTGAAG 
ACAATGCCCATAACAAGGTAACCTACCCATGAGCTTCTGAGGGATT 
TAGGAATTGTCTACCATCTCCTCTCT7UVGAAGGGCTCCCACAATAT 
ATCCCCTTCTGCTTGCTTCTAACTCCCTATCACCTGCTAAAGAAGG 
ACCTCACCTTTTAATCACTTTCATTGCCAAGGGGCACAAGGAGCCC 
CAAACTCTGTCACCTAGGAAGAGCTTGACCTCATGGTTTCCACACT 
GTGTGCTTTTATGTCCCTGCTCCAGGAGATGATGGACATTGTCAAA 
GCCATCTATGACATGATGGGGAAATACACATATCCTGTGCTCAAAG 
AGGACACTCCAAGGCAGCATGTGGACGTCTTCTTCCAGGTAAGTGC 
ACACACCCTGCACATGAGCTGTAAGCCCAGCCTAGATCAAGTCAAC 
CCACGAGCATCTGAGCAAATGATTTGTGTCCAAC [C/T] CTGTACT 
AAGCATGGTTGGTAACAGAAAAGAATTATAAGATACATTGTCCTCA 


SEQ ID 

NO. 236 



2345.2049-005 



-182- 





AGAAACAGATGATCTCCTTAAGCTGCAAGTGTACATGACAGAAGAG 
AACAAGAAAGTATATTATTAAACGCTAGTGGTATAGTATGAACTCT 
AAATCCATAAAAATTTGGGGATCAGGGTAAACACGAAAGACTTCAT 
TAATTACAACTGTGGAGGTGTTAAGCATTTGTGTCTGGGAAGTAAG 
GGGAAATAAGATTGGAAACTAGGATAGGGCCAGATTATGAGACCTT 
TAAATGGAAGAGTTTGGCCTTGCTCTGGTACAGGATGGGCAGCTAG 
TGCTGATCCTTGACTAAGGGAGTGGTATAATCATTGGGGCATTTTA 
GGAAAAAATTAATCTAGCGGTGGAGTATCAGAGAATATCAAGAGTT 
CACTCTAGTTCAACCTCCCACTTTGCAGATGGGAAAAGAGAGTCCT 
CTCTGGCCTTGTGCAAGTTTGTACAGCAAGTAACAGGCCAGAATCA 
GAACCTCTTTTGCCCAGTGTTCTGCCAGATGGACAGGGTAGCAGGG 
AGTCTACAGAAGAAGCAGAATAAGCCAGCAGTGAGGTGATGAGTGT 
CCAGAGCAAGTCTTTTGATTTAAGGAAGCTCATGGGGCTCAAAGTG 
TTGTAATCAGGACCTAATTGGAGTTGTCTGGCCAGTGAAAGACAAC 
TCTCATTCTCAGGGCAAAGTTGGTTAATGAAATGAATGAAATGAGC 
TCCAGCTCGTTACTCTGAGCTCCAGCAAGAAAGCAGGGGAGTAAGC 
TTTGGAATGGAGATCACCAGATTCTGTAAAGTGCTTTCTGTTATGT 
CTTTCAGAAAATGGACAAAAATAAAGATGGCATCGTAACTTTAGAT 
GAATTTCTTGAATCATGTCAGGAGGTAAGGAGAGATCTCAGGGCAC 
AATAACTCTACATCTGGGAAAGGAAACCTGGGGCCTGGGGACCTGC 
AGAAGGAAGGTGATGAGAAAC CTGC AC 




KCP 2385 
91 


TCTAACTCCCTATCACCTGCTAAAGAAGGACCTCACCTTTTAATCA 
CTTTCATTGCCAAGGGGCACAAGGAGCCCCAAACTCTGTCACCTAG 
GAAGAGCTTGACCTCATGGTTTCCACACTGTGTGCTTTTATGTCCC 
TGCTCCAGGAGATGATGGACATTGTCAAAGCCATCTATGACATGAT 
GGGGAAATACACATATCCTGTGCTCAAAGAGGACACTCCAAGGCAG 
CATGTGGACGTCTTCTTCCAGGTAAGTGCACACACCCTGCACATGA 
GCTGTAAGCCCAGCCTAGATCAAGTCAACCCACGAGCATCTGAGCA 
AATGATTTGTGTCCAACCCTGTACTAAGCATGGTTGGTAACAGAAA 
AGAATTATAAGATACATTGTCCTCAAGAAACAGATGATCTCCTTAA 
GCTGCAAGTGTACATGACAGAAGAGAACAAGAAAGTATATTATTAA 
ACGCTAGTGGTATAGTATGAACTCTAAATCCATAAAAATT [C/T] G 
GGGATCAGGGTAAACACGAAAGACTTCATTAATTACAACTGTGGAG 
GTGTTAAGCATTTGTGTCTGGGAAGTAAGGGGAAATAAGATTGGAA 
ACTAGGATAGGGCCAGATTATGAGACCTTTAAATGGAAGAGTTTGG 
CCTTGCTCTGGTACAGGATGGGCAGCTAGTGCTGATCCTTGACTAA 
GGGAGTGGTATAATCATTGGGGCATTTTAGGAAAAAATTAATCTAG 
CGGTGGAGTATCAGAGAATATCAAGAGTTCACTCTAGTTCAACCTC 
CCACTTTGCAGATGGGAAAAGAGAGTCCTCTCTGGCCTTGTGCAAG 
TTTGTACAGCAAGTAACAGGCCAGAATCAGAACCTCTTTTGCCCAG 
TGTTCTGCCAGATGGACAGGGTAGCAGGGAGTCTACAGAAGAAGCA 
GAATAAGCCAGCAGTGAGGTGATGAGTGTCCAGAGCAAGTCTTTTG 
ATTTAAGGAAGCTCATGGGGCTCAAAGTGTTGTAATCAG 


SEQ ID 

NO. 237 | 


KCP 1615 
2 


GGAAGAGCTTGACCTCATGGTTTCCACACTGTGTGCTTTTATGTCC 
CTGCTCCAGGAGATGATGGACATTGTCAAAGCCATCTATGACATGA 
TGGGGAAATACACATATCCTGTGCTCAAAGAGGACACTCCAAGGCA 
GCATGTGGACGTCTTCTTCCAGGTAAGTGCACACACCCTGCACATG 
AGCTGTAAGCCCAGCCTAGATCAAGTCAACCCACGAGCATCTGAGC 
AAATGATTTGTGTCCAACCCTGTACTAAGCATGGTTGGTAACAGAA 
AAGAATTATAAGATACATTGTCCTCAAGAAACAGATGATCTCCTTA 
AGCTGCAAGTGTACATGACAGAAGAGAACAAGAAAGTATATTATTA 
AACGCTAGTGGTATAGTATGAACTCTAAATCCATAAAAATTTGGGG 
ATCAGGGTAAACACGAAAGACTTCATTAATTACAACTGTGGAGGTG 
TTAAGCATTTGTGTCTGGGAAGTAAGGGGAAATAAGATTGGAAACT 
AGGATAGGGCCAGATTATGAGACCTTTAAATGGAAGAGTTTGGCCT 
TGCTCTGGTACAGGATGGGCAGCTAGTGCTGATCCTTGACTAAGGG 
AGTGGTATAATCATTGGGGCATTTTAGGAAAAAATTAATCTAGCGG 


SEQ ID 

NO. 238 



-183- 





TGGAGTATCAGAGAATATCAAGAGTTCACTCTAGTTCAACCTCCCA 
CTTTGCAGATGGGAAAAGAGAGTCCTCTCTGGCCTTGTGCAAGTTT 
GTACAGCAAGTAACAGGCCAGAATCAGAACCTCTTTTGCCCAGTGT 
TCTGCCAGATGGACAGGGTAGCAGGGAGTCTACAGAAGAAGCAGAA 
TAAGCCAGCAGTGAGGTGATGAGTGTCCAGAGCAAGTCTTTTGATT 
TAAGGAAGCTCATGGGGCTCAAAGTGTTGTAATCAGGACCTAATTG 
GAGTTGTCTGGCCAGTGAAAGACAACTCTCATTCTCAGGGCAAAGT 
TGGTTAATGAAATGAATGAAATGAGCTCCAGCTC [A/G] TTACTCT 
GAGCTCCAGCAAGAAAGCAGGGGAGTAAGCTTTGGAATGGAGATCA 
CCAGATTCTGTAAAGTGCTTTCTGTTATGTCTTTCAGAAAATGGAC 
AAAAATAAAGATGGCATCGTAACTTTAGATGAATTTCTTGAATCAT 
GTCAGGAGGTAAGGAGAGATCTCAGGGCACAATAACTCTACATCTG 
GGAAAGGAAACCTGGGGCCTGGGGACCTGCAGAAGGAAGGTGATGA 
GAAACCTGCACATACCTGCAACCCCTCCCATCAGAGCCAACAACAC 
CAGCAACAACTGTGAAGTCCACAGTTCCACTCCTCAACCTGACCTG 
CAGTTGGTCTTGGCTAAGCACAAGACTGAACAGAGAGCCTAAGTAG 
GGGTCTGGGGGCATGTGAAAACTCAGAGGGGGTCTCTGTGAAAATA 
GACTTCCCGAGAGGGCAACACCATTATTTTTTAGCCTGCCTCTGGC 
TTGATGACCCATTTCCCAGACTACAAGGAAGCAGCTGGGGGGAAAA 
AAACCTACAATTGTGTGATTCTCAAACCACAGTGTGCATAAAAATT 
GCCTGGAATGATTCTGAAAATGCATATTTCCAGGCCTCAATCCCAG 
AGACTCTAGATCTGGGTCACTTTAACACAAATGTCCTGGACCAATG 
CTTCTAACACTTTAATGTGTGAAACAATATCCTTGATGATTTTGTT 
AAAATGCAGATTCTAATTCCATAGGTCTGGGGTAGGGCCTGAGATG 
TTACTTTTCTCACATTCTCCCCAGTCACACTGGTGATGCTGATCCT 
GGGAACACAACTTTCATTAAGTCTAACCAATAGACCAGCCCCAGAG 
TCCACCAGAGACTGAACTGGAAATAATTGCTTCATCTACTTTTGAG 
AAATCCATTTGTACCCCCACATTATTTTAGAAATGTTCAGAGTTAC 
TCTGAGCTCCAGCCAAGAAGAATAGCAAATGTAAGAAAGCCGGGGA 
GAAGTTCCTAGCAGATACTGAGCCCCC 




KCP 1806 
9 


TTGAAAGAGAGCGCTTTGGGGGGTTTTCTTACTGTATGTCTCTATT 
GCATGTTCTGTATTTTACATTTTTCTATTATTTCTTCTCTGAGGTA 
TAGTATTGAATGTAGAAAAATCCTCAAATGTTCGGTATTAAGCAAT 
ACACTTCTAATTCATGGTTCAGAGAAGAAAATATCTCGAATAAAAA 
TAAAATAAAAATATGACTTATCAAAATTTGTAGGATCTAAAGCAGT 
ATTCCAGGAATGCAAGGTTGGTTTAACATTCAATAATTGGTCAGTG 
TAATTAATCACATTAATAGAATAAAAAGAGAAAAAATATAATCATT 
TCAGTGGATGTAATTGTTCAGAGCTTCTTAAAAGAAGCAACTCACT 
ATTTTACTAGATGATTTGTTTCTTCTGAATTCCTCTTTAAGGCTAC 
AGGTGGTGCTTCTTACTTTGAACTGATCACTTTCTAGGTCCCCACC 
CTTACTTCTTGTTTTTCATACCCTTGTAGAGTTTTCTCCA [C/T] A 
TAGGAAACCCATGCTTGACATTTGCTCACCAGAGTTACAGAGCTCT 
CAGGGAGGAGACTCAGAGTTCTAACCCTCTTGCCCTCCTTTTTTCC 
CAGGACGACAACATCATGAGGTCTCTCCAGCTGTTTCAAAATGTCA 
TGTAACTGGTGACACTCAGCCATTCAGCTCTCAGAGACATTGTACT 
AAACAACCACCTTAACACCCTGATCTGCCCTTGTTCTGATTTTACA 
CACCAACTCTTGGGACAGAAACACCTTTTACACTTTGGAAGAATTC 
TCTGCTGAAGACTTTCTATGGAACCCAGCATCATGTGGCTCAGTCT 
CTGATTGCCAACTCTTCCTCTTTCTTCTTCTTGAGAGAGACAAGAT 
GAAATTTGAGTTTGTTTTGGAAGCATGCTCATCTCCTCACACTGCT 
GCCCTATGGAAGGTCCCTCTGCTTAAGCTTAAACAGTAGTGCACAA 
AATATGCTGCTTACGTGCCCCCAGCCCACTGCCTCCAAG 


SEQID 
NO. 239 


KCP 2415 
27 


ACTTTGAACTGATCACTTTCTAGGTCCCCACCCTTACTTCTTGTTT 
TTCATACCCTTGTAGAGTTTTCTCCATATAGGAAACCCATGCTTGA 
CATTTGCTCACCAGAGTTACAGAGCTCTCAGGGAGGAGACTCAGAG 
TTCTAACCCTCTTGCCCTCCTTTTTTCCCAGGACGACAACATCATG 
AGGTCTCTCCAGCTGTTTCAAAATGTCATGTAACTGGTGACACTCA 


SEQID 
NO. 240 



rJ 

2345.2049-005 



-184- 





GCCATTCAGCTCTCAGAGACATTGTACTAAACAACCACCTTAACAC 
CCTGATCTGCCCTTGTTCTGATTTTACACACCAACTCTTGGGACAG 
AAACACCTTTTACACTTTGGAAGAATTCTCTGCTGAAGACTTTCTA 
TGGAACCCAGCATCATGTGGCTCAGTCTCTGATTGCCAACTCTTCC 
TCTTTCTTCTTCTTGAGAGAGACAAGATGAAATTTGAGTTTGTTTT 
GGAAGCATGCTCATCTCCTCACACTGCTGCCCTATGGAAG [G/T] T 
CCCTCTGCTTAAGCTTAAACAGTAGTGCACAAAATATGCTGCTTAC 
GTGCCCCCAGCCCACTGCCTCCAAGTCAGGCAGACCTTGGTGAATC 
TGGAAGCAAGAGGACCTGAGCCAGATGCACACCATCTCTGATGGCC 
TCCCAAACCAATGTGCCTGTTTCTCTTCCTTTGGTGGGAAGAATGA 
GAGTTATCCAGAACAATTAGGATCTGTCATGACCAGATTGGGAGAG 
CCAGCACCTAACATATGTGGGATAGGACTGAATTATTAAGCATGAT 
ATTGTCTGATGACCCAAACTGCCCATGTCATTTGTTTCCAGAAACG 
AGGACCAATAATTCTCTCACACTGGCATTTGTGCTGGTAGTACAAG 
TCCTTTAATATGTCCAGGAAGGGAGCCATTGCCCAGTGGTCCATAT 
CTCCACCACATCCCCTGCTTGAGCCCAGCGCTGCATGTCCCTCCCA 
AGAAGTCCAGAATGCCTGCAAATTGCTGTAATTTTATAC 




KCP 2418 
04 


CTGATCTGCCCTTGTTCTGATTTTACACACCAACTCTTGGGACAGA 
AACACCTTTTACACTTTGGAAGAATTCTCTGCTGAAGACTTTCTAT 
GGAACCCAGCATCATGTGGCTCAGTCTCTGATTGCCAACTCTTCCT 
CTTTCTTCTTCTTGAGAGAGACAAGATGAAATTTGAGTTTGTTTTG 
GAAGCATGCTCATCTCCTCACACTGCTGCCCTATGGAAGGTCCCTC 
TGCTTAAGCTTAAACAGTAGTGCACAAAATATGCTGCTTACGTGCC 
CCCAGCCCACTGCCTCCAAGTCAGGCAGACCTTGGTGAATCTGGAA 
GCAAGAGGACCTGAGCCAGATGCACACCATCTCTGATGGCCTCCCA 
AACCAATGTGCCTGTTTCTCTTCCTTTGGTGGGAAGAATGAGAGTT 
ATCCAGAACAATTAGGATCTGTCATGACCAGATTGGGAGAGCCAGC 
ACCTAACATATGTGGGATAGGACTGAATTATTAAGCATGA [C/T] A 
TTGTCTGATGACCCAAACTGCCCATGTCATTTGTTTCCAGAAACGA 
GGACCAATAATTCTCTCACACTGGCATTTGTGCTGGTAGTACAAGT 
CCTTTAATATGTCCAGGAAGGGAGCCATTGCCCAGTGGTCCATATC 
TCCACCACATCCCCTGCTTGAGCCCAGCGCTGCATGTCCCTCCCAA 
GAAGTCCAGAATGCCTGCAAATTGCTGTAATTTTATACCATGTTCT 
AACCAATAAACAGAACTATTTCTTACACTCTCAATCACTTCTTCAT 
GACTCCGTTAGGTAAGAGAGGTAAGCTGTGAAAAGGGAAGGCTAGT 
CCATTCATTTGACACCCAATTATTAGTGCAGTTGTCCCTCCATATG 
TGTGAAGGATCAGTCCCAGGACTCTCCATACCAAAATCTGCAGATA 
CTCAAGTCCCACAGCTAGCCCTGAGGGACTCGTGTTTTCAGAAAAT 
TTGGCCTCCATATATGCAGGTTTCACATCCTATAAATAC 


SEQ ID 
NO. 241 


KCP_1324 


CCCAAGCTCAACCATTCCAATGCCATCTCCTCTGGTTCCAGATAAG 
ATTGAAGATGAGCTGGAGATGACCATGGTTTGCCATCGGCCCGAGG 
GACTGGAGCAGCTCGAGGCCCAGACCAACTTCACCAAGAGGGAGCT 
GCAGGTCCTTTATCGAGGCTTCAAAAATGTAAGACCCGTGCACGCT 
CTGAAGGCCTGGGGGG 


SEQ ID 
NO. 242 


KCP_1520 
4 


TTGTCTACCATCTCCTCTCTAAGAAGGGCTCCCACAATATATCCCC 
TTCTGCTTGCTTCTAACTCCCTATCACCTGCTAAAGAAGGACCTCA 
CCTTTTAATCACTTTCATTGCCAAGGGGCACAAGGAGCCCCAAACT 
CTGTCACCTAGGAAGAGCTTGACCTCATGGTTTCCACACTGTGTGC 
TTTTATGTCCCTGCTC 


SEQ ID 
NO. 243 


KCP_4957 


ACCCTCAATACAGACTGTTCCTACAGTCCACGTCCTCAGCCACTAG 
ACCATACGGCCACTGGGATGATAGACAGACCACTGCAGCCATGGAT 
AAGGCAAAAACAGGGCTGGCTGTGTTGATCTGTGTCTCTCAGAGCT 
CCATTCTTCCTCAAGGGGGCACCTTGCAAAAAAAAACAAAAAAATG 
GGGCAGGGTAGGGAAC 


SEQ ID 

NO. 244 


KCP_5011 


GCCACTGGGATGATAGACAGACCACTGCAGCCATGGATAAGGCAAA 
AACAGGGCTGGCTGTGTTGATCTGTGTCTCTCAGAGCTCCATTCTT 
CCTCAAGGGGGCACCTTGCAAAAAAAAACAAAAAAATGGGGCAGGG 


SEQ ID 
NO. 245 



-185- 





TAGGGAACTGAAGGCAGGAGCTCTTCACAGAGCATAGCCACATCCT 
CCAGGCAGACAAGAGG 




KCP_5051 


GGCAAAAACAGGGCTGGCTGTGTTGATCTGTGTCTCTCAGAGCTCC 
ATTCTTCCTCAAGGGGGCACCTTGCAAAAAAAAACAAAAAAATGGG 
GCAGGGTAGGGAACTGAAGGCAGGAGCTCTTCACAGAGCATAGCCA 
CATCCTCCAGGCAGACAAGAGGACGCAGGAGGCACCATTCTGTGAG 
AGTATCACAGTCTGAC [C/T] CAAAGACACAGCTTCACACTGTCTG 
ATGGCTTGATGGTTAATGTCACTCTGCCTTTTCCCCTTCTCAGGAC 
TTTGTAACCGCTCTGTCGATTTTATTGAGAGGAACTGTCCACGAGA 
AACTAAGGTGGACATTTAATTTGTATGACATCAACAAGGACGGATA 
C AT AAAC AAAGAGGTAAGTGAGCTGGGGC CAGGGGTGT 


SEQ ID 
NO. 246 


KCP_5202 


GACAAGAGGACGCAGGAGGCACCATTCTGTGAGAGTATCACAGTCT 
GACCCAAAGACACAGCTTCACACTGTCTGATGGCTTGATGGTTAAT 
GTCACTCTGCCTTTTCCCCTTCTCAGGACTTTGTAACCGCTCTGTC 
GATTTTATTGAGAGGAACTGTCCACGAGAAACTAAGGTGGACATTT 
AATTTGTATGACATCA [A/C] CAAGGACGGATACATAAACAAAGAG 
GTAAGTGAGCTGGGGCCAGGGGTGTGAGAGGGCTCCAGTGAAGGTA 
ACTAACCCAACAGAAAACAGCCCCAGGCATGAGGATAGCACTGTCT 
GAATGAGGCAGGCTCTGCTTTGGGGCTAACAGAGCTGGTCCCTGGC 
AAAATAAAGAAGGCCTCCCTCATTGCCCTACCCTGCCC 


SEQ ID 
NO. 247 


KCP ela 
249924 


CCACCAGGGTCCCTTCCAACTCACGGAGCCTATGGTACTGA 

ATGGCAGCCAGGTTTTTTATGGAGCAATAGCTGGACTTCAC 

ATTTGCATAATGCCTTGCAGTTTCACTGTTAAGAGTACTGC 

ATTGTATTCTAATTATATGAATCTCGGTCATTCCTTTATGAC 

ATTTCTGAGGAATACTATCTCAATCAAGAAAAGCCCTAATT 

GCACTCCTCTCCTATCCCGGTGAGAGAGCACAGACTCGTGC 

CTGCTCCGCAGGGGTGGAGGCTGGAATTCAGTAGTCTGAGT 

CGGGGATGCCTGGAGCAGGAGGTGGTCAGGGGCATTGTCC 

TTTCCAAGTCAGGAAGGCAGACAGCACCTGCTGTTGGTGCC 

AAGGTTACTGGACAGGCTGCGAGGGCTCTGTCTGTCTGTCC 

GATGTTCACAGGCCAGCTCCCCGGAGGCTCAGCACTCAGCC 

CAGCTTCTCCGAGATGCAAACCAGGCCACTCTGAGGCTGCC 

TACAAACTTTCTGCTGAGTGCCGACAGCTGCTTCCTGCTCTG 

CGGGGAGTTCTTCCAGATCCTGATCAAGGCACAGAGAATTG 

ATCTATCAGATTAACCAGGAAGGAAAGAGTGGGAGAGCGA 

GTGTGGGAGGCTGTGGGGCTGAGTGTTTTCTGCGTAGCAGT 

CCCCTCCCITCTGACTTGAGTATTAATTGCTACATTACCGCT 

GCCATGTAAGAAAGACAGTCAGCAAAGCCTGGGAGAGCTC 

CAGCrCCTCCCTCCCTGCTCTGCTCAACTTCACTCTCCTCCT 

CGGTTCCCTTGGAGTACCTTGTGCCCCGGCAGTGCTGTCCC 

GGCCCTGGCATCCTGAGGTCCTCCCGTGGTGAGGACTTAAG 

TGGACA[C/G]CAGGAGTGGGTGGAGAGAGGGAGGGAGAGT 

TTGCCCTGCAGGCTCTCTGGATGCAGAAGCCAGACTCGCTG 

CAGAGGCAGCTGTGCTGTTCCCGGAGCCTGGCTTCAGGGGT 

GCATCCGTCACTCAGGGTTCATTCACCCAGGCAGGCTCCAA 

GTTCCTGGGGTGCACAAGGTGGGCACTGTCCCTTCTGGGTG 

CTGACAGCAGAGCCTGGCTCCCCTCCGCCACCATGAGCGGC 

TGCTCCAAAAGATGCAAGCTTGGGTTCGTGAAATTTGCCCA 

GACCATCTTTAAGCTCATCACTGGGACCCTCAGCAAAGGTA 

TGGAAACTGGCCTTGACCCTTGCTTTCTGTCTTGATATGGCC 

TGGCTGGTCGCATTGCCTCGGTGTGGTGAGCGTGACCATTC 

TGGTGCACCCAGGTCTTGGAAAAAGCTGGGGAAATTGGTG 

GCTGGGATTCGAGGTTGCTGACAACCTGCGTCCTGGCTTTG 

AGTAGGCGGGCACCCAGCCAGGGAACTCAGCTGGCTGTAA 

TTGCCTGGAACTTTGGAAATGGAGTTGGTGGTGTGTGGCTG 

ATACGTTATGGGCGGGCAGAGGGATAGAACCCTTTCCAGA 


SEQ ID 
NO. 248 



2345.2049-005 



-186- 





GCATTGGAAGTGGCTTAGCGTGACTGGAGTTTCAAGAAGTT 

ATCCATGGAAGGTTGTATTTTGTTGATAAAAGAGAGATTTG 

ATGCAGTGGGTTGTGAGTAATTCTGCAGAACAGAGACGCTT 

GAGGGGGCCAGTGGGAGGTGGTGATGGGCCGGCATCTGCT 

TTGCCCTGGTGGCTTCAGAAACCGGATCAGCTCTGCACCTC 

AAGTGCCAAGAGCCTCCTCTCATAGGGTTCCAGCGTCTCGT 

GCTTCTGGGGCTTCATTCATCGTTCTGCTTTCTTGGATCCCT 

GTCCCTCCACATTTCATGCCTA 




KCP ela 
250027 


CAAGGCACAGAGAATTGATCTATCAGATTAACCAGGAAGGAAAGAG 
TGGGAGAGCGAGTGTGGGAGGCTGTGGGGCTGAGTGTTTTCTGCGT 
AGCAGTCCCCTCCCTTCTGACTTGAGTATTAATTGCTACATTACCG 
CTGCCATGTAAGAAAGACAGTCAGCAAAGCCTGGGAGAGCTCCAGC 
TCCTCCCTCCCTGCTCTGCTCAACTTCACTCTCCTCCTCGGTTCCC 
TTGGAGTACCTTGTGCCCCGGCAGTGCTGTCCCGGCCCTGGCATCC 
TGAGGTCCTCCCGTGGTGAGGACTTAAGTGGACAGCAGGAGTGGGT 
GGAGAGAGGGAGGGAGAGTTTGCCCTGCAGGCTCTCTGGATGCAGA 
AGCCAGACTCGCTGCAGAGGCAGCTGTGCTGTTCCCGGAGCCTGG [ 
C/T] TTCAGGGGTGCATCCGTCACTCAGGGTTCATTCACCCAGGCA 
GGCTCCAAGTTCCTGGGGTGCACAAGGTGGGCACTGTCCCTTCTGG 
GTGCTGACAGCAGAGCCTGGCTCCCCTCCGCCACCATGAGCGGCTG 
CTCCAAAAGATGCAAGCTTGGGTTCGTGAAATTTGCCCAGACCATC 
TTTAAGCTCATCACTGGGACCCTCAGCAAAGGTATGGAAACTGGCC 
TTGACCCTTGCTTTCTGTCTTGATATGGCCTGGCTGGTCGCATTGC 
CTCGGTGTGGTGAGCGTGACCATTCTGGTGCACCCAGGTCTTGGAA 
AAAGCTGGGGAAATTGGTGGCTGGGATTCGAGGTTGCTGACAACCT 
GCGTCCTGGCTTTGAGTAGGCGGGCACCCAGCCAGGGAACTCAGCT 
GGCTGTAA 


SEQ ID 
NO. 249 


KCP_ela_ 
250049 


ACAGAGAATTGATCTATCAGATTAACCAGGAAGGAAAGAGTGGGAG 
AGCGAGTGTGGGAGGCTGTGGGGCTGAGTGTTTTCTGCGTAGCAGT 
CCCCTCCCTTCTGACTTGAGTATTAATTGCTACATTACCGCTGCCA 
TGTAAGAAAGACAGTCAGCAAAGCCTGGGAGAGCTCCAGCTCCTCC 
CTCCCTGCTCTGCTCAACTTCACTCTCCTCCTCGGTTCCCTTGGAG 
TACCTTGTGCCCCGGCAGTGCTGTCCCGGCCCTGGCATCCTGAGGT 
CCTCCCGTGGTGAGGACTTAAGTGGACAGCAGGAGTGGGTGGAGAG 
AGGGAGGGAGAGTTTGCCCTGCAGGCTCTCTGGATGCAGAAGCCAG 
ACTCGCTGCAGAGGCAGCTGTGCTGTTCCCGGAGCCTGGCTTCAGG 
GGTGCATCCGTCACT [A/C] AGGGTTCATTCACCCAGGCAGGCTCC 
AAGTTCCTGGGGTGCACAAGGTGGGCACTGTCCCTTCTGGGTGCTG 
ACAGCAGAGCCTGGCTCCCCTCCGCCACCATGAGCGGCTGCTCCAA 
AAGATGCAAGCTTGGGTTCGTGAAATTTGCCCAGACCATCTTTAAG 
CTCATCACTGGGACCCTCAGCAAAGGTATGGAAACTGGCCTTGACC 
CTTGCTTTCTGTCTTGATATGGCCTGGCTGGTCGCATTGCCTCGGT 
GTGGTGAGCGTGACCATTCTGGTGCACCCAGGTCTTGGAAAAAGCT 
GGGGAAATTGGTGGCTGGGATTCGAGGTTGCTGACAACCTGCGTCC 
TGGCTTTGAGTAGGCGGGCACCCAGCCAGGGAACTCAGCTGGCTGT 
AATTGCCTGGAACTTTGGAAATGGAGTTGGTG 


SEQ ID 
NO. 250 


KCP UTR1 
_382206 


TGGCCCACCTTCAGGGTCATGAGGATTCATAAACCCTATTC 

TGCGAAGTGCCTCCAGGAATCATCAAGGGAGCTAGGGCAG 

CTCTGAGTCTCCACCAGGCCCACCCTCCGCCTCTCAGGGCT 

GAGCTTCACTTCCCTTCCCAAAGGGGCCAGGGAGAGGGGCT 

GCTGATGACATGATCTCAGAGGAAGGCCAAGGCCTCCAGG 

CTGCCTCTGGGCCTGGCACAGGAAGGAGGAGGAGAAAATA 

GGGAGCCCAAGGAAAGATCAACCCAGCCCAGCCCAAGGAC 

CCCCAGCCCCAGCCCCAGCCCCAGCTGGGCTCAAACTAATT 

GAAAACAGACTGGAAAAGGCTGCrTTTGCCCTTCCTCTAGA 

CTCAGCATCATCAAGACTGGAGGGACAGAGCATTTGAATC 


SEQ ID 
NO. 251 



-187- 





ATCAGACGCTGGGCCAGA[C/T]GTCACCCCACGCGTTTTCTC 
ATTTTATCGTCCTAAGAAGCCCAGAAGGTGCGTAAAATGGC 
CTGTCCCAAACAGATGAGGACATTACCTTTCTCCTCTTCCTC 
CTCCTCCTTCTTCTTCTTCTTCTT^ 

lTll Tl CCCCCAGATGTTGCATTTCAGAGAGGCTGAGCGTGT 

TGACTAAGGTCACACAGCTACAAACATCAGGGACCTGCGA 

AAAAGCTCTGTTCCCTGGTGACAGGTGTTCTGTGATCCTAA 

CACAGCCGGAGGTGGGGACAACGTCCTTGCAGTAACAAAG 

GCCCTGTTGCTCAACTCAGTGGACATCAGGCCCTGTTTTCAT 

TCATTAGCAGGTCAGGGATTCCAGTGTCACCTGTGCCATGT 

ATTCCAGCTGATCTACCTGCAAGCCTCTACTCCCCATTTTCC 

CAGCAGCAGCCGCAGACACCACCCAACTGG 




KCPJJTR1 
_382272 


GGGTCATGAGGATTCATAAACCCTATTCTGCGAAGTGCCTC 

CAGGAATCATCAAGGGAGCTAGGGCAGCTCTGAGTCTCCA 

CCAGGCCCACCCTCCGCCTCTCAGGGCTGAGCTTCACTTCC 

CTTCCCAAAGGGGCCAGGGAGAGGGGCTGCTGATGACATG 

ATCTCAGAGGAAGGCCAAGGCCTCCAGGCTGCCTCTGGGCC 

TGGCACAGGAAGGAGGAGGAGAAAATAGGGAGCCCAAGG 

AAAGATCAACCCAGCCCAGCCCAAGGACCCCCAGCCCCAG 

CCCGAGCCCCAGCTGGGCTCAAACTAATTGAAAACAGACTG 

GAAAAGGCTGCTTTTGCCCTTCCTCTAGACTCAGCATCATC 

AAGACTGGAGGGACAGAGCATTTGAATCATCAGACGCTGG 

GCCAGACGTCACCCCACGCGTTTTCTCATTTTATCGTCCTAA 

GAAGCCCAGAAGGTGCGTAAAATGGCCTGT[A/C]CCAAACA 

GATGAGGACATTACCTTTCTCCTCTTCCTCCTCCTCCTTCTT 

CTTCTTCTTCTTTTTGCTTCATTTTTCTTTCAT^ 

GATGTTGCATTTCAGAGAGGCTGAGCGTGTTGACTAAGGTC 

ACACAGCTACAAACATCAGGGACCTGCGAAAAAGCTCTGT 

TCCCTGGTGACAGGTGTTCTGTGATCCTAACACAGCCGGAG 

GTGGGGACAACGTCCTTGCAGTAACAAAGGCCCTGTTGCTC 

AACTCAGTGGACATCAGGCCCTGTTTTCATTCATTAGCAGG 

TCAGGGATTCCAGTGTCACCTGTGCCATGTATTCCAGCTGA 

TCTACCTGCAAGCCTCTACTCCCCATTTTCCCAGCAGCAGCC 

GCAGACACCACCCAACTGGCAGAAATTTCAAACAAGGGGT 

TCTGCCTTGCACTCCGGTGCAAGGGTTGGGCACGTGGACTC 

ACAT 


SEQ ID 
NO. 252 


KCP_3UTR 
2_395068 


CACAAAACAAATCCGGGACTTTAAGCCTGATCTGCTTGACCTGAAA 
CTCATATCTACTTCCCTGCCCTCTGAAGATCTATATGTCCTATGTC 
ATCACTTCACTGTTCACACAAGGTGATACCTGGCTTCTCCAAGCAC 
CTGCTACCCTGAACTTACTGCACCACTCTTTCCTTCCTAGCCTGAA 
TGCAATTTGCAATGAGGAGATGATTTGATTTTCTTCAGCCCTAGAC 
CTCCAGCTTCCTGAGAGCAGGTACTCTTGCCTCTTCTTGCTCATTA 
TTGATC CATATATTTAGAATAGCGC CTGGCAGGTAGATGGTGCTTA 
ATAAATATTCATTGAATAAATGAATGAATGAATGATC CAATGAGC C 
CCAAAGCAAATAACAATAAAGGACATTTGCAGAGTGCTCTACAGAG 
AGACAAGTGCTTTCCCTTT [A/G] CTTTATCTTACCCCATTCTCAC 
AACAATCCCCTGACATGATTGGGTTCATGTTTCACAGATGAGGAGG 
CTAACGGCCAGGTGTACATACCAGGGGACATGGGACTGGGTTCATA 
TGAGCTCAGGGGTAAATGATGACACCCTTTCCCCTGCCCTGAAGGA 
TCTCAGTTTGAGTATTTGTAGCACACTTAGGATGTTCTGGGCCAGG 
CTGAGTGGCGGTGGATGGGGGCGGTGGAGGTGGGGTATGCAAAGCA 
GGAAACTCGGCCTTTGCTTTCTAAAAGCTCCCAGTCTATTTGAGGC 
CAGACTTATGCATGCAGAACATTTGGGAAATGGTACAAGACAGCAG 
CAAGCATAGTGCTGAATTGCACATAATCAGGTGCCAACTGCATTCC 
CTTCCTTAACTAATCT 


SEQ ID 
NO. 253 


KCP 3UTR 


AACTTTCTCCTCAGCAAAGAGCTCTCCTCTGTTCCCTGAATCCTGG 


SEQ ID 



-188- 



3_398480 


ATATCCCACTGGGTCCTCTAGTGACCCCAAGCTTCAGCCTCGCATG 
CCCTCTTCTCGAACAGAGAAGGCAGGAGGGAAGCAGGGACCAGCCC 
CTGCTCCATCTTCCAGGATTCCAGGCCTCCCTGGCCTGGACAAGCC 
CTGAGCTGGCAGTTAGGAGAGCAGAGGTTGTGAATCTGGTGGGACC 
CCCAGCAGGTCTTTCTGGCTCAGTGCCCTCATCTGTGAGCAGGGGT 
TCCCCAGGAGACCACGACAGAGGCCTGGAACCCAAGTTCTAATCCC 
ACATCCTGGCTGGGCAACTTCAGGCAAATTTCTAACACAAGGTAAG 
CCTCAATTTCTCTCTGGGGTAATGATCAGGCACCTGCTTAATTCAC 
AGGGGTTTGGTGGGCATCA TC/Tl GTGGACAATGT(^CJTTfi^A^Artr , 

AGTGGGCAATGCAAAGGAAAGGAAGTATGTTAGTAAGTGCCCCTCC 
CCTGTTGCACAAAACAGGACACATGCTGGGATTGCAGAAAAGCAAT 
AAATGCTGCACAGGTGAAGAAAACTATTCAAGGACCCTGGCCAAGT 
CACAGGCTACCTGTGGCCCTGAGGGGACAGCTCATGGGTTGGCATT 
AGGGGAAGCAGCTCTCAAGGGGCCTGTATCCTGGGGATTCAACTCT 
GTGCCTATGTGGCATTGAGCCTGTGTGAATGTGGTGACTGTCATGC 
TGTTTTGCTGTGTGTGCGTCTGCATGCCTGTGTGTTTGTGTGTCTC 
TCCACCTTCGTGGGGGGCAACTGTAGGTGTATTATGAGCCTTGGGT 
CTGTCTGTGTGTACAATAGCAATGTCTGTGCGGACTTAAGGACCTG 
CGCCCATATGTTTGTGGGACTTTC 


NO. 254 


KCP 3UTR 
3_398605 


CAGAGAAGGCAGGAGGGAAGCAGGGACCAGCCCCTGCTCC 

ATCTTCCAGGATTCCAGGCCTCCCTGGCCTGGACAAGCCCT 

GAGCTGGCAGTTAGGAGAGCAGAGGTTGTGAATCTGGTGG 

GACCCCCAGCAGGTCTTTCTGGCTCAGTGCCCTCATCTGTG 

AGCAGGGGTTCCCCAGGAGACCACGACAGAGGCCTGGAAC 

CCAAGTTCTAATCCCACATCCTGGCTGGGCAACTTCAGGCA 

AATTTCTAACACAAGGTAAGCCTCAATTTCTCTCTGGGGTA 

ATGATCAGGCACCTGCTTAATTCACAGGGGTTTGGTGGGCA 

TCACGTGGACAATGTGGTTGCACAGCAGTGGGCAATGCAA 

AGGAAAGGAAGTATGTTAGTAAGTGCCCCTCCCCTGTTGCA 

CAAAACAGGACACATGCTGGGATTGCAGAAAAGCAATAAA 

TGCTGCA[C/T]AGGTGAAGAAAACTATTCAAGGACCCTGGC 

CAAGTCACAGGCTACCTGTGGCCCTGAGGGGACAGCTCATG 

GGTTGGCATTAGGGGAAGCAGCTCTCAAGGGGCCTGTATCC 

TGGGGATTCAACTCTGTGCCTATGTGGCATTGAGCCTGTGT 

GAATGTGGTGACTGTCATGCTGTTTTGCTGTGTGTGCGTCTG 

CATGCCTGTGTGTTTGTGTGTCTCTCCACCTTCGTGGGGGGC 

AACTGTAGGTGTATTATGAGCCTTGGGTCTGTCTGTGTGTA 

CAATAGCAATGTCTGTGCGGACTTAAGGACCTGCGCCCATA 

TGTTTGTGGGACTTTCTGGGCATGCATGCTTGTTTATGAGGC 

CATACATCCGGGTATTCTGTGAACTGCTAGCATGGTGTGTA 

TCTGTGTGGCAGACAGAAAATGGCTGGGTGGGA 


SEQ ID 

NO. 255 


KCP elb 
399912 


ATCTCAGCACTTTGGGAGGCCAAGGCGGGTGGATCACCTGA 

GGTCAGGAGTTCAAGCCCAGCCAGCCCAACATGGCGAAAC 

CCCGTCTCTATTAAAAAATACAAAAAAATTTAGCTGGGCCT 

AGTGGTGGGCGCCTGTAATCCCAGCTACTCCGGAGGCTGAG 

GCAGGAGAATCGCTTGAATCTGGGAGGCAGAGGTTGCAGT 

GAGCAGAGATCGCACCACTGCACTCCAGCCTGGGCAACAG 

AGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAGAAAAAG 

AAAAATGAGAGTGTAAGGGCCCAGAGGGGCTGAGGGCTCC 

TTTCTCCTCCCCAACTCCCTGTCACTAGAAGGTGGGCCCTGC 

CATAGGAGGATTCTGCAGAACCCTCAAGGACCCGCGGAGC 

AGGACGGCACCTTCTTCCCATGACCACCCATTTGGATGTGT 

TTTTCACCCCTTTCTGGGTGGGGCAGACTTTCCCCCTCCCCA 

TGAGTTCAGGCAG[G/T]GGGTTAAATAAGATTTCCCTTGAA 

GTCGAATGAAATCACAATGCACCACACACAGGGACACACA 

CACACACACACGCACGCACGCACATCACACACACACACAC 


SEQ ID 
NO. 256 



-189- 





ACACACACACACACACACACACACATACACACACACAGTC 

TCCCTGGGGCCAATCTACTGCCCCCTGAACCTCACCCATCA 

GCCAGGTGCCTGGCCCCGGGTCTGTCTCTTAGGGTTACATG 

CTCCCGGGGCTCCCGCACATACCCCGGCAGATGAGGGTGCG 

CAGGGGTGAGGGCGCAGGGCTGGGCGTCCCCCGCCCCCAC 

CGTGCAGCCCTCGCCCCCGCCCCGCCCCTCCGTAGTTGCCC 

GCCCGCCGCCCCCTCCGCCGCCCCCTCCGCCGCTCCGACTC 

TCGCCCCGAGCGCTGGCAGCAGGCAGCAGGCAGCAGGCGG 

GCGCGCTGTGGCTCCGCGCCGCGCGGTCCGGGCTCTGTTCA 

TTCATGATTGGTACTCGGCCCTCCGAGACC 




rsl02685 


AGCACTCCTGGGGCTCATTGTTAAGTTTATAAAACTCAGAG 

CTGATGAGTTGTGTGCACTGTGTGGGTCTGAGTGGGCTTAT 

GACTCCCCTCCAAGCCTGGCTGTAAGAATCTAAGACTTAAA 

GCTGAAGGACCAAATGGGACTTTCTGTCCCATCCCCTCTCT 

GCTCCATGCAAGCACCAA[C/T]GTGGATTTTTGCCCCTAATT 

ATATTAGGGAACGCTGTCAATCAAAAAGATGATGTTAAACT 

CATCCAGAACAAACCAAACCATGTTTAAGGGGAAGAAAAG 

ATTACATCTTCAAATGCCAGCATGCCATCATTAATACAATG 

TCTAATGTAGTCAATATAGTTCAGGCAACATTGAAAATGAA 

CCACTGCAAATACTAGGAATACAATTTCAAGAGGAAGCAC 

AACATTCTGTGTTTCTATGCACACAGTCCTGTAAATTATTTG 

CAGCTCAAGTATGTCATGTTCTTTTAAATTTTCCCCTGGGTA 

CAGCTTGAACAACTTCCTACAAGTGTTGATATGTCATATTCT 

CATTATCATTTAGTTCAAAATTACCATGATTTAATTACCATG 

AGGTTGCTTTTTTGATACATGAGTTACTTAGAAATTGAATTA 

ggctaggcatggtggcttccacctataatcctagcactttggaaggccaaggcaggaggattgctt 

gagtttgaggccagtctaggcaatatagtgagacctcatctccccaaaagtacaaaaaaactagcc 

aggcatggggacacatgcctatagttccagctactcaaaggctgaggtggggaggattgctttgag 

cctggg 


SEQ ID 
NO. 257 


rs905808 


GCCAGCTATCCCCAGAGACATCACAGGAGAAGGAGCAGAAGCTGGA 
ACATCATCCGGGAGCTGGACTAGAACGTCCCGGGAAACTTCAGCCT 
GGCTTCTGCTTTGTCCCGAAAACCCAGGGGCTCCAGCTCCAGGGCT 
GTGTCTTAGAATGAGGCAGTTTATCTGTTCAGGGCTTCTCTTAGTT 
TTTAATCCCAATAGGACACA [C/T] GTTGTATTAAAAAGCCATGCG 
AGATGGAAGAAGGAAATTGAATGAAATTTGAGGGCAGGTAGGAGCA 
GAGACAATAAATAATTCAGCAGTGAAGGAAGCAGAAAAAAGATTGC 
ACTCATTTCGCCCTTCAACAATTATACTAAACACCTGCTCTGGGCC 
ACAGAAGGGCCAGATCCCATTCCTGTGCTCAGGAAGCCCACAGGCC 
GGCAGGGAGAGGCTGGTTGGAATGTGTGCTTTGCACTGTAACGGAG 
GCATCGAGCATGGTAAGGGACTGGCGGTGACTGCTGCCTGCGGACG 
TCGAGACAGGGGCCTTTGAAGAGGCAGGACCTGTCTGGAGTCTTAC 
CTGGGCCTTGGCCTGGCAATGGGG 


SEQ ID 

NO. 258 



2345.2049-005 



-190- 



Tablel 1 . The Build 33 location of SNPs and microsatellites employed for the first- 
pass association analysis across KChlPl. 

5 



Start (B33) 


Marker 


Public 
Alias 


deCODE alias 


Variation 


169788696 


DG5S47 








169794522 


DG5S1592 








169843903 


DG5S119 








169869845 


rs933656 


rs933656 


DG00AAFCS 


A/G 


169869955 


rs2339091 


rs2339091 


DG00AAFCI 


G/T 


169961410 


DG5S13 








169964087 


rs905808 


rs905808 


SG05S1212 


C/T 


170006645 


rs883849 


rs883849 


SG05S206 


A/G 


170015858 


DG5S123 








170037283 


rs2 135046 


rs2 135046 


SG05S159 


C/T 


170041996 


DG5S124 








170056955 


rs2339139 


rs2339139 


DG00AAFCR 


A/G 


170064881 


rs329468 


rs329468 


SG05S896 


A/G 


170070041 


rs50057 


rs50057 


SG05S1270 


A/G 


170070735 


rs 102685 


rsl02685 


SG05S905 


C/T 


170073252 


rs50364 


rs50364 


DG00AAFCD 


A/G 


170081292 


KCP_1152 




SG05S176 


C/T i 


170081473 


KCPJ333 




SG05S921 


A/G 


170082789 


KCP_2649 




SG05S923 


C/T 


170085116 


KCP_4976 




SG05S187 


C/T 


170085217 


KCP_5077 




SG05S179 


A/T 


170095540 


KCP_15400 




SG05S946 


C/T 


170096292 


KCP 16152 


rs4868018 


SG05S948 


A/G 


170098209 


KCP 18069 


rsl363712 


SG05S189 


C/T 


170105556 


D5S625 
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2345.2049-005 

-191- 



Table 12. The Build 33 location of SNPs found through sequencing across 
KC hIPl(from exon lb to exon 8). 



Build rOS 


Project 

rOS 


DECODE 

A T T A O 

ALIAS 


SEQ PROJECT 

A T T A O 

ALIAS 


PUBLIC 
ALIAS 


SNP 




yo/7 


SG05S2107 


KCP 9677 


rs6555900 


C/G 


loyoo/4o5 


t A1CC 


SG05S229 


KCP 10355 




A/T 




1 A/t AC 

ll)44o 


rvPAA A A TT A Ti 

DuUUAAHAK 


KCP 10446 




C/G 


ioyo / lio / 


1 AQAH 
14o4 / 


MjUjS4o5 


KCP_ 14847 


rs4867608 


A/T 


i AOQ77 1 OQ 
loyo /ZlZy 


1 fAI A 

iDUiy 


CPACC 1 OAO 

MjUjSizyo 


KCP15019 


rs4867973 


A/G 


ioys /Z4i / 


1 jJU/ 


a(jU5S437 


KCP 15307 




A/C 


1 01/5 /Z4Z1 


1 ^7 1 1 
1331 1 


ovjUjo4Jo 


V/^Tl 1 CI 1 1 

KCP 15311 




A/T 


ioyo /zhjj 


1 jdIj 


CPA^Cyiin 

ouujMjy 


KCP 15325 




C/G 


1 AO ft 7? 0,40 
lOyo i Ayny 


1 ^810 

iDojy 


oVjUjo44U 


KCP_1 joJ9 




A/G 




1 A/170 

i04zy 


C/^A.^Q/1 OA 

oLrlOo4oO 


KCP_lo4Z9 




C/T 


loyo /jOoU 


1 £<7fl 

1D3 /u 


5vjUjl>4o / 


KCP_16570 




A/G 


lOVo /D 1ZJ 


1 OA 1 1 

loUU 


orAcc/i o o 


"Lf/^Tl 1 OA1 1 

KCP 18013 




A/T 


1 AQQ7^£C 

loyo /jjoo 


1 QASQ 
1o4jo 


bvjUjMUUZ 


KCP_lo45o 


rs655590l 


A/G 


lOVo /OjUZ 


1 0 1 go 

iy iyz 


buUDb4oy 


XT' f~^T\ mi AO 

KCP 19192 




A/G 


loyo /oJo!> 


Z1ZD5 


SG05S490 


KCP 21 255 




G/T 


loyo /o /j4 


Zloz4 


SO05S491 


KCP_21624 


rs4867609 


A/G 


loyo /yo /o 


ZZjoo 


b(jr05!S4yz 


KCP 22568 




A/C 


loyo /y / 1 / 


ZZoU / 


Cr , ACP>!A') 

!SvjU5S493 


KCP 22607 




C/T 


ioyooi4yo 


Z43oo 




KCP_24386 




A/G 


loyoozooi 


1j j /I 




KCP 25571 




A/C 


loyoojZoD 


zoIjj 


CrACC/1 AiC 


KCP 26155 


rs744345 1 


A/G 


1 AOftQim 
1 OVoojjjj 


ZOZZJ 


oLrU!)i34y / 


KCP 26223 




C/G 


lOyooj'f 1*? 


ZOJUJ 


cnn^c/ioo 
oLrUjo4y5 


KCP_zo303 




A/G 




ZOj J 3 


O/^A^O 1 1 HI 
1/1 


KCPZ6355 




C/G 


169883518 


26408 


SG05S499 


KCP 2640R 




A/T 


169883738 


26628 


SG05S500 


KCP 26628 




A/G 


169883811 


26701 


SG05S501 


KCP 26701 




A/G 


169884084 


26974 


SG05S1172 


KCP_26974 




C/T 


169884145 


27035 


SG05S502 


KCP 27035 




G/T 


169884439 


27329 


SG05S503 


KCP_27329 | 




C/T 


169884682 


27572 


SG05S504 


KCP 27572 




A/G 


169884707 


27597 


DGOOAAJHT 


KCP 27597 




VG 


169884973 


27863 


SG05S505 


KCP 27863 




VG 


169885005 


27895 


SG05S506 


KCP 27895 




VG 



2345.2049-005 



-192- 



169888453 


31343 


SG05S507 


KCP_31343 


rs4867975 


C/T 


169889433 


32323 


SG05S60 


KCP 32323 




C/T 


169889680 


32570 


SG05S508 


KCP 32570 




A/G 


169890025 


32915 


SG05S509 


KCP 32915 




A/G 


169890055 


32945 


SG05S1173 


KCP 32945 


rs6873409 


A/G 


169890089 


32979 


SG05S1174 


KCP 32979 


rs6873133 


A/C 


169890291 


33181 


SG05S510 


KCP 33181 


rs6873872 


A/G 


169892122 


35012 


SG05S1175 


KCP_35012 




A/C 


169892332 


35222 


SG05S511 


KCP 35222 


rs7724503 


A/G 


169892524 


35414 


SG05S61 


KCP 35414 




G/T 


169892619 


35509 


SG05S512 


KCP 35509 


rs6885463 


C/T 


169892687 


35577 


SG05S513 


KCP 35577 




G/T 


169893157 


36047 


SG05S514 


KCP 36047 


rs6555903 


C/T 


169893169 


36059 


SG05S515 


KCP 36059 


rs6555904 


C/T 


169893871 


36761 


SG05S516 


KCP 36761 




A/C 


169894061 


36951 


SG05S517 


KCP_36951 




A/G 


169894358 


37248 


SG05S518 


KCP 37248 




C/G 


169895507 


38397 


SG05S1176 


KCP 38397 




C/T 


169895699 


38589 


SG05S953 


KCP 38589 




A/C 


169896322 


39212 


SG05S519 


KCP 39212 


rs7737732 


G/T 


169896357 


39247 


SG05S520 


KCP 39247 




A/G 


169896369 


39259 


SG05S521 


KCP 39259 




A/G 


169896451 


39341 


SG05S1177 


KCP_39341 




A/G 


169896647 


39537 


SG05S522 | 


KCP 39537 




C/T 


169896750 


39640 


SG05S523 


KCP 39640 




A/T 


169896914 


39804 


SG05S524 


KCP_39804 




A/G 


169897484 


40374 


SG05S525 


KCP 40374 




C/T 


169897594 


40484 


SG05S526 


KCP 40484 




A/G 


169897621 


40511 


SG05S527 


KCP 40511 




C/T 


169897856 


40746 


SG05S528 


KCP 40746 




C/T 


169898205 


41095 


SG05S529 


KCP 41095 




C/T 


169898252 


41142 


SG05S530 


KCP 41142 




C/T 


169898371 


41261 


SG05S531 


KCP_41261 




A/G 


169899446 


42336 


SG05S532 


KCP 42336 




A/G 


169899693 


42583 


SG05S533 


KCP_42583 




A/G 


169900156 


43046 


SG05S534 


KCP 43046 




A/G 


169900425 


43315 


SG05S1178 


KCP 43315 




Z/G 


169900629 


43519 


SG05S535 


KCP_43519 


< 


Z/T 


169902212 


45102 


SG05S536 


KCP_45102 i 


-s21 12601 - 


VG 



2345.2049-005 



-193- 



169902400 


45290 


SG05S537 


KCP_45290 




G/T 


169903206 


46096 


SG05S538 


KCP 46096 




C/T 


169903615 


46505 


SG05S539 


KCP 46505 




C/T 


169903676 


46566 


SG05S540 


KCP 46566 




A/C 


169903766 


46656 


SG05S541 


KCP_46656 




A/C 


169904530 


47420 


SG05S542 


KCP 47420 




C/T 


169904757 


47647 


SG05S543 


KCP 47647 




A/G 


169906262 


49152 


SG05S1179 


KCP_49152 




A/G 


169906576 


49466 


SG05S544 


KCP_49466 




A/G 


169906846 


49736 


SG05S545 


KCP_49736 




A/T 


169907866 


50756 


SG05S1180 


KCP 50756 




A/G 


169908937 


51827 


SG05S1181 


KCP 51827 




C/T 


169909190 


52080 


SG05S1182 


KCP 52080 




C/T 


169910099 


52989 


SG05S546 


KCP 52989 




A/G 


169910133 


53023 


SG05S547 


KCP 53023 




C/T 


169911784 


54674 


SG05S548 


KCP 54674 




A/C 


169911823 


54713 


SG05S549 


KCP_54713 




A/C 


169913086 


55976 


SG05S1183 


KCP 55976 




A/G 


169913415 


56305 


SG05S62 


KCP 56305 




A/G 


169913670 


56560 


SG05S954 


KCP_56560 




C/T 


169913988 


56878 


SG05S550 


KCP_56878 




C/G 


169914731 


57621 


SG05S551 


KCP_57621 




A/G 


169914887 


57777 


SG05S552 


KCP_57777 




A/G 


169915597 


58487 


SG05S553 


KCP_58487 




A/G 


169917130 


60020 


SG05S554 


KCP 60020 




C/T 


169917579 


60469 


SG05S555 


KCP 60469 




A/G 


169917813 


60703 


SG05S556 


KCP 60703 




A/G 


169919206 


62096 


SG05S557 


KCP_62096 




A/G 


169919909 


62799 


SG05S233 


KCP_62799 




C/T 


169921008 


63898 


SG05S558 


KCP_63898 




A/G 


169921407 


64297 


SG05S559 


KCP 64297 




A/G 


169921917 


64807 


SG05S560 


KCP 64807 




G/T 


169922010 


64900 


SG05S1184 


KCP 64900 




A/G 


169922309 


65199 


SG05S955 


KCP 65199 




A/G 


169922397 


65287 


SG05S561 


KCP 65287 




G/T 


169923449 


66339 


SG05S562 


KCP 66339 




A/G 


169923611 


66501 


SG05S563 


KCP 66501 




A/G 


169924005 


66895 


SG05S564 


ECCP 66895 




A/G 


169925422 


68312 


SG05S956 


tCCP 68312 




A/C 



2345.2049-005 



-194- 



169926039 


68929 


SG05S565 


KCP_68929 




C/T 


169926454 


69344 


SG05S566 


KCP 69344 




A/G 


169926756 


69646 


SG05S567 


KCP_69646 




C/T 


169927013 


69903 


SG05S568 


KCP_69903 




A/G 


169927893 


70783 


SG05S569 


KCP 70783 




C/T 


169928063 


70953 


SG05S570 


KCP 70953 




A/T 


169928076 


70966 


SG05S571 


KCP 70966 




AJC 


169928444 


71334 


SG05S572 


KCP_71334 




C/T 


169928522 


71412 


SG05S573 


KCP 71412 




A/T 


169928555 


71445 


SG05S1185 


KCP 71445 




C/T 


169928665 


71555 


SG05S1186 


KCP_71555 




C/T 


169928700 


71590 


SG05S1187 


KCP 71590 




C/T 


169929635 


72525 


SG05S574 


KCP 72525 


rs4269297 


A/G 


169929849 


72739 


SG05S575 


KCP_72739 




C/G 


169930171 


73061 


SG05S576 


KCP 73061 


rs4867613 


C/T 


169930506 


73396 


SG05S577 


KCP 73396 




A/T 


169930538 


73428 


SG05S578 


KCP 73428 


rs4867978 


A/G 


169930644 


73534 


SG05S579 


KCP 73534 


rs4867979 


C/T 


169931073 


73963 


SG05S580 


KCP 73963 




C/G 


169931425 


74315 


SG05S581 


KCP_74315 




A/G 


169931663 


74553 


SG05S582 


KCP 74553 




G/T 


169931670 


74560 


SG05S583 


KCP_74560 




C/T 


169932137 


75027 


SG05S584 


KCP_75027 




C/T 


169932696 


75586 


SG05S585 


KCP_75586 


rs7723669 


A/C 


169932998 


75888 


SG05S586 


KCP 75888 




C/T 


169933181 


76071 


SG05S587 


KCP 76071 


rs386758 


A/G 


169933212 


76102 


SG05S588 


KCP 76102 


rs386759 


C/T 


169933256 


76146 


SG05S589 


KCP 76146 




A/G 


169933389 


76279 


SG05S1188 


KCP_76279 


rs4368746 


C/T 


169933420 


76310 


SG05S590 


KCP 76310 




C/T 


169933699 


76589 


SG05S591 


KCP_76589 




C/T 


169933756 


76646 


SG05S592 


KCP_76646 




C/T 


169934348 


77238 


SG05S593 


KCP_77238 


i 


C/T 


169934429 


77319 


SG05S594 


KCP 77319 


( 


C/G 


169934556 


77446 


SG05S595 


KCP_77446 


< 


C/T 


169934663 


77553 


SG05S596 


fCCP_77553 


( 


C/T 


169934751 


77641 . 


5G05S597 


fCCP_77641 i 


-S4242157 t 


VG 


169934936 


77826 : 


3G05S598 


fCCP_77826 


( 


:/g 


169934949 


77839 ! 


5G05S599 


SCCP_77839 i 


•S7735198 / 


VG 



2345.2049-005 



-195- 



169935134 


78024 


SG05S600 


KCP_78024 


rs4867981 


A/G 


169935240 


78130 


SG05S601 


KCP_78130 


rs4867614 


C/T 


169935254 


78144 


SG05S602 


KCP 78144 




A/C 


169935713 


78603 


SG05S603 


KCP_78603 




C/T 


169935892 


78782 


SG05S604 


KCP 78782 




A/G 


169935939 


78829 


SG05S605 


KCP_78829 




A/G 


169935989 


78879 


SG05S606 


KCPJ78879 




C/T 


169936272 


79162 


SG05S607 


KCP 79162 




C/T 


169936275 


79165 


SG05S608 


KCP 79165 




C/T 


169936329 


79219 


SG05S609 


KCP_79219 




G/T 


169936495 


79385 


SG05S610 


KCP_79385 


rs6876518 


C/T 


169936910 


79800 


SG05S61 1 


KCP 79800 




C/G 


169937029 


79919 


SG05S1189 


KCP 79919 




A/G 


169937270 


80160 


SG05S612 


KCP_80160 




A/G 


169937896 


80786 


SG05S613 


KCP 80786 




A/G 


169938126 


81016 


SG05S614 


KCP 81016 




C/T 


169938400 


81290 


SG05S615 


KCP 81290 




A/G 


169938894 


81784 


SG05S1190 


KCP_81784 




A/G 


169939578 


82468 


SG05S957 


KCP 82468 


rs4242158 


A/G 


169940311 


83201 


SG05S616 


KCP_83201 




C/T 


169940995 


83885 


SG05S617 


KCP_83885 




A/G 


169941106 


83996 


SG05S618 


KCP_83996 


l:s4867615 


A/G 


169941897 


84787 


SG05S1191 


KCP_84787 




A/T 


169942667 


85557 


SG05S619 


KCP_85557 




A/G 


169942775 


85665 


SG05S620 


KCP 85665 


rs6892193 


C/T 


169942903 


85793 


SG05S958 


KCP_85793 


rs6892514 


C/T 


169943046 


85936 


SG05S621 


KCP 85936 




A/G 


169943817 


86707 


SG05S622 


KCP_86707 




A/T 


169944237 


87127 


SG05S623 


KCP 87127 


rs6881347 


C/G 


169945487 


88377 


SG05S624 


KCP_88377 




C/T 


169945857 


88747 


SG05S625 


KCP_88747 




A/T 


169945886 


88776 


SG05S626 


KCP 88776 




C/T 


169945923 


88813 


SG05S627 


KCP 88813 ! 




A/G 


169946380 ! 


89270 


SG05S628 


KCP_89270 




A/G 


169946491 


89381 


SG05S629 


KCP_89381 


rs4867983 


A/G 


169947228 


90118 


SG05S630 


KCP_90118 




MG 


169947236 


90126 


SG05S631 


KCP_90126 




3/T 


169947285 


90175 . 


SG05S632 


KCP_90175 




ZTT 


169947471 


90361 ! 


5G05S633 


KCP 90361 


< 


Z/G 



2345.2049-005 



-196- 



169947529 


90419 


SG05S634 


KCP 90419 




C/T 


169947661 


90551 


SG05S635 


KCP 90551 




A/G 


169947834 


90724 


SG05S636 


KCP 90724 




A/G 


169948187 


91077 


SG05S637 


KCP_91077 


rs6874152 


A/G 


169948683 


91573 


SG05S1192 


KCP 91573 




A/G 


169948703 


91593 


SG05S1193 


KCP 91593 




G/T 


169948722 


91612 


SG05S1194 


KCP 91612 




A/G 


169948755 


91645 


SG05S1195 


KCP_91645 




C/T 


169948788 


91678 


SG05S1196 


KCP_91678 




A/G 


169948798 


91688 


SG05S1197 


KCP_91688 




C/T 


169948977 


91867 


SG05S638 


KCP 91867 




C/T 


169949063 


91953 


SG05S639 


KCP 91953 




C/T 


169949229 


92119 


SG05S640 


KCP 92119 




C/T 


169949277 


92167 


SG05S641 


KCP 92167 




A/T 


169949352 


92242 


SG05S642 


KCP 92242 




A/G 


169949354 


92244 


SG05S643 


KCP 92244 


rs4867984 


A/G 


169949449 


92339 


SG05S644 


KCP 92339 




C/T 


169950146 


93036 


SG05S63 


KCP 93036 




A/G 


169950148 


93038 


SG05S645 


KCP 93038 




A/G 


169950333 


93223 


SG05S646 


KCP 93223 


rs4867985 


C/T 


169950655 


93545 


SG05S64 


KCP 93545 




G/T 


169950703 


93593 


SG05S1198 


KCP_93593 




C/G 


169950754 


93644 


SG05S654 


KCP_93644 




G/T 


169950844 


93734 


SG05S655 


KCP_93734 




C/T 


169950855 


93745 


SG05S656 


KCP 93745 




G/T 


169950892 


93782 


SG05S1199 


KCP 93782 




C/G 


169950990 


93880 


SG05S657 


KCP 93880 




C/T 


169951245 


94135 


SG05S1200 


KCP 94135 




A/C 


169951290 


94180 


SG05S1201 


KCP 94180 




A/G 


169951422 


94312 


SG05S658 


KCP 94312 




A/T 


169951577 


94467 


SG05S659 


KCP 94467 




A/G 


169951689 


94579 


SG05S660 


KCP 94579 




A/G 


169951702 


94592 


SG05S661 


KCP 94592 




A/G 


169951831 


94721 


SG05S662 


KCP 94721 


( 


C/G 


169951838 


94728 


SG05S663 


KCP 94728 


i 


A/G 


169951848 


94738 


SG05S664 


KCP_94738 


< 


err 


169951855 


94745 


SG05S665 


ECCP_94745 


i 


VG 


169952144 


95034 


SG05S1202 


K£P_95034 


1 


VG 


169952209 ! 


95099 


SG05S666 


fCCP 95099 


I 


vc 



2345.2049-005 



-197- 



169952705 


95595 


SG05S667 


KCP_95595 




A/G 


169952838 


95728 


SG05S670 


KCP_95728 




A/G 


169952962 


95852 


SG05S671 


KCP_95852 




A/G 


169953175 


96065 


SG05S672 


KCP_96065 




C/G 


169953185 


96075 


SG05S673 


KCP_96075 


rs4354060 


A/G 


169953207 


96097 


SG05S674 


KCP_96097 


rs4374772 


C/G 


169953297 


96187 


SG05S675 


KCP_96187 




A/G 


169953327 


96217 


SG05S676 


KCP_96217 




A/G 


169953334 


96224 


SG05S677 


KCP 96224 




A/G 


169953426 


96316 


SG05S678 


KCP_96316 


rs6862741 


A/G 


169953728 


96618 


SG05S1203 


KCP_96618 




C/G 


169953902 


96792 


SG05S679 


KCP_96792 


rs4867987 


C/T 


169954134 


97024 


SG05S680 


KCP_97024 


rs4867988 


C/T 


169954165 


97055 


SG05S1204 


KCP_97055 


rs4867989 


C/T 


169954260 


97150 


SG05S1205 


KCP 97150 




A/G 


169954800 


97690 


SG05S681 


KCP 97690 


rs6868698 


A/T 


169954954 


97844 


DG00AAJIA 


KCP_97844 


rs2202438 


A/T 


169955450 


98340 


SG05S682 


KCP_98340 




C/T 


169956638 


99528 


SG05S683 


KCP_99528 




A/C 


169956932 


99822 


SG05S684 


KCP_99822 




C/T 


169957089 


99979 


SG05S685 


KCP_99979 




A/G 


169957538 


100428 


SG05S1206 


KCP_1 00428 




G/T 


169958211 


101101 


SG05S1207 


KCPJ01101 


rs4495201 


A/G 


169958651 


101541 


SG05S1208 


KCP_101541 




A/G 


169958784 


101674 


SG05S686 


KCP 101674 




A/C 


169959085 


101975 


SG05S687 


KCPJ01975 




A/G 


169959172 


102062 


SG05S1209 


KCP 102062 




A/T 


169959537 


102427 


SG05S688 


KCP 102427 




A/G 


169959561 


102451 


SG05S1210 


KCP_1 02451 




C/T 


169959860 


102750 


SG05S1211 


KCP 102750 




C/T 


169959992 


102882 


DGOOAAJIB 


KCP 102882 




C/T 


169961135 


104025 


SG05S689 


KCP_1 04025 


rs4867990 


A/G 


169961268 


104158 


SG05S690 


KCP 104158 




G/T 


169961404 


104294 


SG05S691 


KCP_104294 


rs4867991 


A/G 


169961971 


104861 


SG05S692 


KCP_1 04861 




A/G 


169962144 


105034 


SG05S693 


KCP 105034 




A/G 


169962410 


105300 


SG05S694 


KCP 105300 


rs4242159 - 


AT 


169962429 


105319 


SG05S695 


KCP 105319 i 


rs4428429 < 


ZIG 


169962889 


105779 


SG05S696 


KCP_1 05779 




VG 



2345.2049-005 



-198- 



169962929 


105819 


SG05S697 


KCP 105819 




C/T 


169963467 


106357 


SG05S698 


KCP 106357 


rs4867990 


A/G 


169963592 


106482 


SG05S699 


KCP 106482 




C/T 


169963741 


106631 


SG05S700 


KCP 106631 




A/G 


169963761 


106651 


SG05S701 


KCP 106651 




A/G 


169963827 


106717 


SG05S702 


KCP_106717 




A/T 


169964021 


106911 


SG05S703 


KCP 106911 


rs905807 


C/G 


169964087 


106977 


SG05S1212 


KCP_1 06977 


rs905808 


C/T 


169964112 


107002 


SG05S1213 


KCP_ 107002 


rs905809 


C/T 


169964368 


107258 


SG05S988 


KCP 107258 


rs905811 


A/G 


169964490 


107380 


DG00AAJIC 


KCP 107380 




A/G 


169964862 


107752 


SG05S705 


KCP 107752 


rs905812 


A/T 


169964998 


107888 


SG05S706 


KCP 107888 




A/T 


169965204 


108094 


SG05S707 


KCP 108094 




C/T 


169965210 


108100 


SG05S708 


KCP 108100 




C/T 


169965293 


108183 


SG05S709 


KCP 108183 




C/T 


169965384 


108274 


SG05S710 


KCP 108274 




C/T 


169965778 


108668 


SG05S1214 


KCP 108668 




C/T 


169965813 


108703 


SG05S230 


KCP 108703 




G/T 


169965814 


108704 


SG05S711 


KCP 108704 




A/G 


169965989 


108879 


SG05S712 


KCP 108879 




A/T 


169966345 


109235 


SG05S713 


KCP 109235 




C/G 


169966790 


109680 


SG05S714 


KCP 109680 




A/C 


169966813 


109703 


SG05S715 


KCP 109703 


rs6877169 


A/G 


169966833 


109723 


SG05S716 


KCP 109723 




A/G 


169966856 


109746 


SG05S718 


KCP_109746 


rs905813 


A/G 


169967196 


1 10086 


SG05S719 


KCP 110086 




C/T 


169967509 


1 10399 


SG05S720 


KCP 110399 




C/G 


169968134 


111024 


SG05S721 


KCP 111024 




A/C 


169968258 


111148 


SG05S722 


KCP_111148 


rs7726675 


C/T 


169968588 


111478 


SG05S723 


KCP 111478 


rs2089191 


C/G 


169968602 


111492 


SG05S724 


KCP_1 11492 




A/G 


169968614 


111504 


SG05S725 


KCP 111504 




C/G 


169969010 


111900 


SG05S726 


KCP 111900 




MG 


169969185 


1 12075 


SG05S727 


KCP_1 12075 


i 


VG 


169969769 


112659 


SG05S728 


KCP_1 12659 


rs4867994 ( 


211 


169970341 


113231 


SG05S729 


KCP_113231 


i 


MG 


169970367 


113257 


SG05S730 


KCP 113257 


rs4867616 , 


KIG 


169970440 


113330 ! 


SG05S733 


KCP 113330 




MG 



2345.2049-005 



-199- 



169971048 


113938 


SG05S734 


KCP_1 13938 




A/G 


169971464 


114354 


SG05S736 


KCP 114354 




A/G 


169971531 


114421 


SG05S1215 


KCP 114421 




C/T 


169971568 


114458 


SG05S737 


KCP_1 14458 


rs2879337 


C/T 


169971621 


114511 


SG05S738 


KCP 114511 




C/T 


169972209 


115099 


SG05S740 


KCP 115099 


rsl 553537 


A/G 


169972598 


115488 


SG05S741 


KCP 115488 


rs6870612 


C/G 


169973254 


116144 


SG05S742 


KCP 116144 


rsl013922 


C/T 


169973325 


116215 


SG05S743 


KCP_1 16215 




A/G 


169973369 


116259 


SG05S744 


KCP_1 16259 




A/G 


169973465 


116355 


SG05S745 


KCP 116355 


rs2089192 


A/G 


169974479 


117369 


SG05S746 


KCP_1 17369 


rs870109 


A/T 


169974926 


117816 


SG05S747 


KCP 117816 


rsl 553538 


C/T 


169976065 


118955 


SG05S1216 


KCP 118955 




C/T 


169977940 


120830 


SG05S748 


KCP 120830 


rs905819 


C/T 


169978197 


121087 


SG05S749 


KCP 121087 




C/T 


169978247 


121137 


SG05S192 


KCP_121137 




A/G 


169978339 


121229 


SG05S193 


KCP 121229 




C/T 


169978427 


121317 


SG05S1217 


KCP 121317 




C/T 


169980304 


123194 


SG05S751 


KCP_123194 




A/G 


169980403 


123293 


SG05S752 


KCP_1 23293 




A/G 


169980481 


123371 


SG05S1218 


KCP_123371 




A/G 


169980664 


123554 


SG05S753 


KCP_123554 




C/T 


169981035 


123925 


SG05S1219 


KCPJ23925 




A/G 


169981067 


123957 


SG05S754 


KCP 123957 




A/G 


169981628 


124518 


SG05S755 


KCP_124518 




C/T 


169981632 


124522 


SG05S756 


KCP 124522 




G/T 


169981987 


124877 


SG05S194 


KCP 124877 


rs4146511 


C/T 


169982473 


125363 


SG05S757 


KCP 125363 


rs2202436 


A/T 


169982868 


125758 


SG05S758 


KCP_125758 




C/T 


169983196 


126086 


SG05S195 


KCP 126086 


rs2202437 


A/G 


169983318 


126208 


DG00AAJHA 


KCP 126208 




T/C 


169983565 


126455 


SG05S1220 


KCP 126455 




C/G 


169983591 


126481 


SG05S759 


KCP 126481 


rs2221441 


C/G 


169983692 


126582 


SG05S760 


KCP_ 126582 




A/G 


169985824 


128714 


SG05S1221 


KCP 128714 




A/G 


169985916 


128806 


SG05S151 


KCP 128806 


i 


A/G 


169985985 


128875 


SG05S761 


KCP_1 28875 


< 


C/T 


169986162 


129052 


5G05S763 


KCP 129052 


rs4867617 ( 


C/G 



2345.2049-005 



-200- 



169986174 


129064 


SG05S762 


KCP_129064 




C/G 


169986189 


129079 


SG05S764 


KCP 129079 


rs4867618 


C/T 


169986203 


129093 


SG05S152 


KCP 129093 


rs4867995 


C/G 


169986237 


129127 


SG05S480 


KCP 129127 


rs4867619 


A/G 


169986334 


129224 


SG05S765 


KCP 129224 


rs486762 


G/T 


169986478 


129368 


SG05S766 


KCP 129368 




C/G 


169986579 


129469 


SG05S181 


KCP_ 129469 




A/G 


169986800 


129690 


SG05S182 


KCP_1 29690 


rs4867996 


G/T 


169986957 


129847 


SG05S767 


KCP 129847 


rs4867997 


A/G 


169986984 


129874 


SG05S985 


KCP_129874 


rs4867998 


A/C 


169986999 


129889 


SG05S986 


KCP_1 29889 


rs4867999 


A/G 


169987419 


130309 


DG00AAJHB 


KCP 130309 




A/G 


169987667 


130557 


SG05S196 


KCP 130557 


rs905822 


C/G 


169988155 


131045 


SG05S768 


KCP_131045 




A/G 


169988354 


131244 


SG05S197 


KCP 131244 


rs905824 


A/G 


169988368 


131258 


SG05S769 


KCP 131258 


rs905825 


C/T 


169988581 


131471 


SG05S770 


KCP 131471 


rs905826 


A/G 


169988714 


131604 


SG05S1222 


KCP 131604 




A/G 


169988812 


131702 


SG05S771 


KCP 131702 


rs905827 


C/T 


169988905 


131795 


SG05S65 


KCP_131795 


rs4868001 


C/T 


169988964 


131854 


SG05S153 


KCP 131854 


rs6861734 


G/T 


169989037 


131927 


SG05S772 


KCP_131927 


rs6865908 


A/G 


169989257 


132147 


SG05S773 


KCP_132147 




C/T 


169989533 


132423 


SG05S774 


KCP_132423 




A/G 


169989704 


132594 


SG05S775 


KCP 132594 


rs4868002 


G/T 


169989739 


132629 


SG05S776 


KCP_132629 




A/G 


169989787 


132677 


SG05S154 


KCP 132677 




A/G 


169990284 


133174 


SG05S777 


KCP_133174 




C/T 


169990366 


133256 


SG05S1223 


KCP_1 33256 




A/G 


169990548 


133438 


SG05S778 


KCP 133438 


rs4867621 


A/G 


169990840 


133730 


SG05S779 


KCP 133730 




C/T 


169990962 


133852 


SG05S780 


KCP 133852 




A/G 


169991155 


134045 


SG05S198 I 


KCP 134045 




C/T 


169991415 


134305 


SG05S199 


KCP_1 34305 


rs7737768 


C/T 


169991521 


134411 


SG05S781 


KCP 134411 


rs65 55907 


C/T 


169991729 


134619 


SG05S1224 


KCPJ34619 


i 


MC 


169991939 


134829 


SG05S782 


KCP_1 34829 


( 


C/T 


169992076 


134966 


SG05S783 


KCP 134966 


i 


VG 


169992155 


135045 . 


3G05S784 


KCP 135045 


l 


VG 



2345.2049-005 



-201- 



169992628 


135518 


SG05S200 


KCP 135518 


rs4868003 


G/T 


169992821 


135711 


SG05S785 


KCPJ35711 




G/T 


169993032 


135922 


SG05S786 


KCP 135922 




A/G 


169993096 


135986 


SG05S183 


KCP_1 35986 




A/G 


169993146 


136036 


SG05S481 


KCP 136036 




A/C 


169993585 


136475 


SG05S787 


KCP_136475 




C/T 


169994082 


136972 


SG05S201 


KCP 136972 


rs4868004 


A/G 


169994770 


137660 


SG05S202 


KCP 137660 




A/G 


169995924 


138814 


SG05S788 


KCP_138814 




C/T 


169997343 


140233 


SG05S789 


KCP 140233 




C/T 


169997640 


140530 


SG05S1225 


KCP_140530 




A/G 


169998201 


141091 


SG05S1226 


KCP 141091 




A/G 


170000256 


143146 


SG05S1227 


KCP 143146 


rs953601 


C/T 


170000611 


143501 


SG05S1228 


KCP_143501 




C/T 


170000722 


143612 


SG05S66 


KCP 143612 


rs4867622 


A/G 


170000869 


143759 


SG05S790 


KCP 143759 




C/T 


170000983 


143873 


SG05S1229 


KCP 143873 




C/T 


170001571 


144461 


SG05S1230 


KCP_144461 




C/T 


170001578 


144468 


SG05S1299 


KCP_144468 


rs931805 


C/T 


170002070 


144960 


SG05S203 


KCP_144960 


rs2279873 


C/T 


170002435 


145325 


SG05S791 


KCP 145325 


rs6891256 


C/T 


170002801 


145691 


SG05S1231 


KCP_145691 




A/G 


170003438 


146328 


SG05S792 


KCP_1 46328 




A/G 


170003572 


146462 


SG05S793 


KCP_146462 




G/T 


170003856 


146746 


SG05S482 


KCP_146746 




C/T 


170003940 


146830 


SG05S1232 


KCP 146830 




C/T 


170004075 


146965 


SG05S794 


KCP_146965 




C/T 


170004199 


147089 


SG05S1233 


KCP_1 47089 




C/G 


170004733 


147623 


SG05S204 


KCP 147623 


rs2292146 


C/T 


170005151 


148041 


SG05S795 


KCP 148041 




C/T 


170006326 


149216 


SG05S205 


KCP 149216 


rs6555908 


A/G 


170006485 


149375 


SG05S796 


KCP 149375 


rs883848 


G/T 


170006645 


149535 


SG05S206 


KCP_149535 


rs883849 


A/G 


170006910 


149800 


SG05S1234 


KCP_149800 




A/G 


170007023 


149913 


SG05S797 


KCP_149913 


rs4867623 


211 


170007516 


150406 , 


3G05S798 


KCP_1 50406 


rs4868005 ( 


3/T 


170007640 


150530 . 


3G05S987 


K£P_1 50530 


( 


2/1 


170007808 


150698 ! 


3G05S799 


KCP_1 50698 


( 


3/T 


170007921 


150811 ! 


3G05S155 ] 


fCCP_150811 


L 


VG 



2345.2049-005 



-202- 



170008215 


151105 


SG05S800 


KCP_151105 




G/T 


170008937 


151827 


SG05S801 


KCP_151827 


rs2339094 


A/G 


170009218 


152108 


SG05S1235 


KCPJ52108 




A/G 


170009587 


152477 


SG05S802 


KCP_1 52477 




C/T 


170009592 


152482 


SG05S803 


KCP 152482 




A/C 


170010385 


153275 


SG05S1236 


KCP 153275 


rs6866371 


C/T 


170010518 


153408 


SG05S1237 


KCP 153408 




C/T 


170010943 


153833 


SG05S804 


KCP_1 53833 




C/T 


170011041 


153931 


DG00AAJHC 


KCP_1 53931 


rs2879338 


A/G 


170011269 


154159 


SG05S805 


KCP_154159 




A/G 


170011475 


154365 


SG05S1238 


KCP_1 54365 




A/G 


170011963 


154853 


SG05S806 


KCP 154853 




C/T 


170012367 


155257 


SG05S807 


KCP 155257 




C/G 


170013726 


156616 


SG05S808 


KCP_156616 




C/T 


170013842 


156732 


SG05S207 


KCP_1 56732 


rs924876 


A/T 


170015154 


158044 


SG05S809 


KCP_1 58044 




A/G 


170015582 


158472 


SG05S810 


KCP_1 58472 




C/T 


170015603 


158493 


SG05S811 


KCP_1 58493 




A/G 


170015680 


158570 


SG05S812 


KCP 158570 




C/T 


170015727 


158617 


SG05S67 


KCP 158617 


rs2036559 


C/T 


170016200 


159090 


SG05S813 


KCP 159090 


rs6889236 


A/G 


170016255 


159145 


SG05S814 


KCP 159145 




A/G 


170016259 


159149 


SG05S815 


KCP_1 59149 




C/T 


170016791 


159681 


SG05S1239 


KCP_159681 




A/G 


170016798 


159688 


SG05S1240 


KCP 159688 




A/G 


170017255 


160145 


SG05S208 


KCP_160145 




A/G 


170017524 


160414 


SG05S816 


KCP 160414 




G/T 


170018297 


161187 


SG05S817 


KCP_161187 




A/G 


170018356 


161246 


SG05S818 


KCP_161246 




C/G 


170018549 


161439 


SG05S819 


KCP_161439 




A/G 


170018573 


161463 


SG05S820 


KCP 161463 




C/T 


170019258 


162148 


SG05S821 


KCP_162148 




C/T 


170019314 


162204 


SG05S1241 


KCP 162204 




A/C 


170019379 


162269 


SG05S822 


KCP 162269 




A/T 


170019414 


162304 


SG05S823 


KCP 162304 




C/G 


170019958 


162848 


SG05S824 


KCP 162848 




C/G 


170020197 


163087 


SG05S825 


KCP 163087 


rs6871693 


C/G 


170020606 


163496 


SG05S826 


KCP 163496 




A/G 


170020870 


163760 


SG05S827 


KCP 163760 


i 


A/G 



2345.2049-005 



-203- 



170021444 


164334 


SG05S1242 


KCP_1 64334 




A/G 


170022007 


164897 


SG05S209 


KCP_1 64897 




A/G 


170022125 


165015 


SG05S828 


KCP_165015 




G/T 


170022343 


165233 


SG05S1243 


KCP_1 65233 




C/T 


170022545 


165435 


SG05S1244 


KCP_165435 




C/T 


170023275 


166165 


SG05S829 


KCP 166165 




A/G 


170024034 


166924 


SG05S1245 


KCP 166924 


rs4867624 


C/T 


170024668 


167558 


SG05S830 


KCP_167558 




A/G 


170025753 


168643 


SG05S1246 


KCP_1 68643 




A/G 


170025970 


168860 


SG05S1247 


KCP_1 68860 


rs2202439 


C/G 


170026021 


168911 


SG05S1248 


KCP_168911 




A/G 


170026162 


169052 


SG05S1249 


KCP 169052 




A/G 


170026344 


169234 


SG05S156 


KCP 169234 




A/G 


170028032 


170922 


SG05S1297 


KCP_1 70922 


rs4868008 


A/C 


170028055 


170945 


SG05S831 


KCP 170945 




C/G 


170028163 


171053 


SG05S1250 


KCP 171053 


rs4868009 


A/G 


170028303 


171193 


SG05S1300 


KCP_171193 


rs4868010 


G/T 


170028752 


171642 


SG05S1251 


KCP_171642 




G/T 


170028987 


171877 


SG05S832 


KCP_171877 




A/G 


170030482 


173372 


SG05S833 


KCP 173372 




A/G 


170030815 


173705 


SG05S834 


KCP_1 73705 




C/T 


170030958 


173848 


SG05S210 


KCP 173848 




A/G 


170030986 


173876 


SG05S1252 


KCP_1 73876 




C/T 


170031092 


173982 


SG05S157 


KCP 173982 


rs6875696 


A/C 


170031149 


174039 


SG05S835 


KCP_1 74039 




C/T 


170031150 


174040 


SG05S836 


KCP_1 74040 




A/G 


170031353 


174243 


DG00AAJHF 


KCP 174243 




A/G 


170031709 


174599 


SG05S837 


KCP_1 74599 




C/T 


170031812 


174702 


SG05S838 


KCP 174702 




C/T 


170031962 


174852 


SG05S839 


KCP_1 74852 




A/G 


170031972 


174862 


SG05S840 


KCP 174862 


rs4628005 


G/T 


170032216 


175106 


SG05S158 


KCP 175106 


rs2339095 


C/G 


170032280 


175170 


SG05S211 


KCP 175170 


rs6555910 


A/G 


170032361 


175251 


SG05S841 


KCP_1 75251 




C/T 


170032362 


175252 


DGOOAAJHG 


KCP 175252 


rs7721722 


A/G 


170032610 


175500 


SG05S842 


KCP_1 75500 


i 


VG 


170032814 


175704 


SG05S843 


KCP 175704 


i 


VG 


170033021 


175911 


5G05S844 


KCP_175911 


l 


VG 


170033923 


176813 . 


5G05S845 


KCPJ76813 


l 


VG 



2345.2049-005 



-204- 



170033946 


176836 


DG00AAJHH 


KCP_1 76836 




A/G 


170034620 


177510 


SG05S184 


KCP 177510 


rs4868011 


A/C 


170034720 


177610 


SG05S1253 


KCP_177610 




G/T 


170034980 


177870 


SG05S846 


KCP_1 77870 




G/T 


170035009 


177899 


SG05S847 


KCP 177899 


rs4868012 


C/T 


170036929 


179819 


SG05S848 


KCP 179819 




C/T 


170037010 


179900 


SG05S1254 


KCP 179900 




G/T 


170037283 


180173 


SG05S159 


KCP 180173 


rs2 135046 


C/T 


170037347 


180237 


SG05S212 


KCP 180237 


rs2135047 


C/G 


170038967 


181857 


SG05S1255 


KCP 181857 




C/T 


170039237 


182127 


SG05S1256 


KCP 182127 




C/T 


170039419 


182309 


SG05S849 


KCP 182309 




A/T 


170041190 


184080 


SG05S160 


KCP 184080 


rs2292147 


C/G 


170041385 


184275 


SG05S964 


KCP 184275 




A/G 


170042689 


185579 


DGOOAAJDX 


KCP 185579 




C/A 


170043158 


186048 


SG05S213 


KCP 186048 




A/G 


170043789 


186679 


SG05S161 


KCP 186679 




C/G 


170043953 


186843 


SG05S850 


KCP 186843 




A/C 


170043997 


186887 


SG05S965 


KCP_1 86887 




C/T 


170044226 


187116 


DGOOAAJDY 


KCP 187116 




A/G 


170044277 


187167 


SG05S851 


KCP 187167 




C/G 


170044368 


187258 


SG05S162 


KCP 187258 




G/T 


170044661 


187551 


SG05S853 


KCP 187551 




A/G 


170044798 


187688 


DGOOAAJDZ 


KCP 187688 




T/A 


170044904 


187794 


SG05S966 


KCP_1 87794 




C/T 


170045075 


187965 


SG05S967 


KCP_1 87965 




C/T 


170046043 


188933 


SG05S968 


KCP 188933 




C/T 


170046441 


189331 


SG05S214 


KCP_1 89331 




A/G 


170047120 


190010 


SG05S854 


KCP 190010 


rs2221442 


A/G 


170047129 


190019 


SG05S855 


KCP 190019 




C/G 


170048070 


190960 


SG05S856 


KCP 190960 




C/G 


170048074 


190964 


SG05S857 


KCP 190964 




C/T 


170048090 


190980 


SG05S858 


KCP_1 90980 




C/G 


170048315 


191205 


SG05S859 


KCP 191205 


rs4868015 


C/T 


170048733 


191623 


SG05S860 


KCP 191623 




\JG 


170049238 


192128 


SG05S990 


KCP 192128 


< 


C/T 


170049852 


192742 


DGOOAAJEB 


KCP 192742 


rsl 973529 ' 


r/c 


170050303 


193193 


DGOOAAJEC 


KCP 193193 


( 


3/A 


170051066 


193956 


SG05S163 


KCP 193956 i 


rs2202440 < 


C/T 



2345.2049-005 
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170051438 


194328 


SG05S861 


KCP_1 94328 




A/T 


170051462 


194352 


SG05S862 


KCP_1 94352 




A/G 


170051726 


194616 


DG00AAJEE 


KCP 194616 


rs2036560 


T/C 


170051899 


194789 


SG05S970 


KCP_1 94789 




C/T 


170052012 


194902 


SG05S863 


KCP_1 94902 




A/G 


170052171 


195061 


SG05S971 


KCP_1 95061 




G/T 


170052988 


195878 


SG05S864 


KCP 195878 




C/T 


170053658 


196548 


DGOOAAJEF 


KCP_196548 




A/G 


170053669 


196559 


SG05S865 


KCP_1 96559 


rs7702368 


A/G 


170053840 


196730 


SG05S866 


KCP 196730 




G/T 


170053939 


196829 


SG05S867 


KCP 196829 




C/G 


170054581 


197471 


SG05S972 


KCP 197471 




A/G 


170054620 


197510 


SG05S973 


KCP 197510 




C/T 


170054788 


197678 


DGOOAAJEG 


KCP 197678 


rs962804 


T/C 


170054803 


197693 


SG05S884 


KCP_197693 




A/G 


170054885 


197775 


DGOOAAJEH 


KCP 197775 




C/T 


170055781 


198671 


DGOOAAJEI 


KCP_1 98671 




A/G 


170055957 


198847 


SG05S974 


KCP_1 98847 




A/G 


170056043 


198933 


DGOOAAJEJ 


KCP 198933 




G/A 


170056137 


199027 


SG05S975 


KCP_1 99027 




A/G 


170056475 


199365 


DGOOAAJEK 


KCP 199365 


rs6555911 


A/G 


170056516 


199406 


SG05S164 


KCP 199406 


rs6887777 


A/T 


170056578 


199468 


SG05S1257 


KCP_1 99468 




C/T 


170057283 


200173 


SG05S165 


KCP 200173 




C/G 


170057351 


200241 


DGOOAAJEL 


KCP 200241 




A/G 


170057605 


200495 


SG05S976 


KCP 200495 




A/G 


170057933 


200823 


SG05S991 


KCP 200823 




A/C 


170058193 


201083 


SG05S992 


KCP 201083 


rs4464713 


C/T 


170058699 


201589 


SG05S885 


KCP_201589 




C/T 


170059095 


201985 


DGOOAAJEM 


KCP 201985 




G/A 


170059177 


202067 


DGOOAAJEN 


KCP 202067 


rs2221440 


A/G 


170059203 


202093 


SG05S977 


KCP_202093 




A/C 


170059905 


202795 


DGOOAAJEO 


KCP 202795 


rs875184 


C/T 


170060219 


203109 


SG05S1258 


KCP 203109 




A/G 


170060292 


203182 


SG05S978 


KCP 203182 




A/G 


170060393 


203283 


SG05S979 


KCP_203283 


rs905818 


A/G 


170061018 


203908 


SG05S980 


KCP_203908 


rs905817 


ZIT 


170061292 


204182 , 


3G05S981 


KCP 204182 i 


-S872435 


3/T 


170061352 


204242 ! 


3G05S166 


KCP 204242 ] 


-S6897344 < 


ZIT 



2345.2049-005 
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170061419 


204309 


SG05S982 


KCP_204309 




A/G 


170061618 


204508 


SG05S983 


KCP_204508 


rs872436 


A/G 


170061670 


204560 


SG05S1259 


KCP 204560 




A/G 


170061727 


204617 


SG05S984 


KCP_204617 


rs6876574 


C/T 


170061799 


204689 


SG05S1260 


KCP 204689 


rs905816 


G/T 


170061809 


204699 


SG05S1261 


KCP 204699 




A/T 


170061845 


204735 


SG05S1262 


KCP 204735 


rs905815 


C/T 


170062696 


205586 


SG05S886 


KCP 205586 


rs329466 


C/T 


170062747 


205637 


SG05S887 


KCP 205637 


rs7721804 


A/C 


170062756 


205646 


SG05S888 


KCP 205646 


rs329467 


C/T 


170062777 


205667 


SG05S889 


KCP 205667 


rs7721817 


A/G 


170062940 


205830 


SG05S167 


KCP 205830 




C/T 


170062950 


205840 


SG05S890 


KCP 205840 




A/G 


170063305 


206195 


SG05S891 


KCP_206195 




C/G 


170063313 


206203 


SG05S892 


KCP 206203 




C/T 


170063377 


206267 


SG05S168 


KCP 206267 




A/G 


170063732 


206622 


SG05S893 


KCP_206622 


rs7727631 


A/G 


170063817 


206707 


SG05S894 


KCP 206707 




C/T 


170063983 


206873 


SG05S1263 


KCP_206873 


rs7710016 


A/T 


170064013 


206903 


SG05S1264 


KCP_206903 




C/G 


170064648 


207538 


SG05S895 


KCP 207538 




C/T 


170064760 


207650 


SG05S969 


KCP_207650 




A/G 


170064771 


207661 


SG05S169 


KCP_207661 




C/G 


170064881 


207771 


SG05S896 


KCP 207771 


rs329468 


A/G 


170065075 


207965 


SG05S170 


KCP 207965 




C/T 


170065694 


208584 


SG05S171 


KCP 208584 




A/G 


170065711 


208601 


SG05S232 


KCP 208601 


rs329469 


A/C 


170065715 


208605 


SG05S897 


KCP 208605 




A/G 


170065740 


208630 


SG05S172 


KCP_208630 




C/T 


170065834 


208724 


SG05S1265 


KCP 208724 


rs7700434 


C/T 


170066123 


209013 


SG05S1266 


KCP 209013 


rs7734240 


C/T 


170066260 


209150 


SG05S1267 


KCP_209150 




A/G 


170067967 


210857 


SG05S898 


KCP 210857 


rs2194162 


A/G 


170068018 


210908 


SG05S899 


KCP 210908 


i 


C/G 


170068420 


211310 


SG05S900 


KCP 211310 


4 


MG 


170068510 


21 1400 


SG05S901 


KCP 211400 


rs4 10348 , 


MG 


170068614 


211504 


SG05S902 


KCP 211504 


i 


VG 


170068635 


211525 


SG05S173 


KCP 211525 


l 


VG 


170068731 


211621 


SG05S903 


KCP 211621 


< 


A/G 



2345.2049-005 
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170068759 


211649 


SG05S1268 


KCP_2 11649 




G/T 


170068960 


211850 


SG05S185 


KCP_211850 


rs329470 


C/T 


170069885 


212775 


SG05S186 


KCP 212775 


rs4349730 


A/G 


170070003 


212893 


SG05S1269 


KCP_212893 


rs6877532 


G/T 


170070041 


212931 


SG05S1270 


KCP 212931 


rs50057 


A/G 


170070593 


213483 


SG05S904 


KCP 213483 




A/G 


170070700 


213590 


SG05S1271 


KCP 213590 


rsl 02684 


C/T 


170070735 


213625 


SG05S905 


KCP_213625 


rsl02685 


C/T 


170070768 


213658 


SG05S1272 


KCP 213658 


rsl 02686 


A/G 


170071584 


214474 


SG05S1273 


KCP_214474 


rs329471 


C/G 


170071665 


214555 


SG05S1274 


KCP_214555 


rs329472 


C/T 


170071715 


214605 


SG05S1275 


KCP 214605 


rs329473 


C/G 


170072023 


214913 


SG05S1276 


KCP_214913 




A/G 


170072363 


215253 


SG05S906 


KCP_215253 


rs4041562 


C/T 


170072373 


215263 


SG05S907 


KCP_2 15263 


rsl 72944 


C/T 


170072484 


215374 


SG05S908 


KCP 215374 




A/G 


170072485 


215375 


SG05S909 


KCP_2 15375 




A/G 


170072562 


215452 


SG05S910 


KCP_215452 


rsl91297 


A/G 


170072712 


215602 


SG05S1277 


KCP_2 15602 


rsl 86646 


A/C 


170072813 


215703 


SG05S174 


KCP_2 15703 




A/C 


170073179 


216069 


SG05S1278 


KCP 216069 




C/T 


170073555 


216445 


SG05S1279 


KCP_216445 


rsl 363709 


A/G 


170073565 


216455 


SG05S1280 


KCP_2 16455 


rs329474 


C/G 


170074202 


217092 


SG05S993 


KCP_2 17092 


rs984559 


A/G 


170074303 


217193 


SG05S994 


KCP_217193 




C/T 


170074359 


217249 


SG05S995 


KCP_2 17249 


rs329475 


A/G 


170075932 


218822 


SG05S996 


KCP_218822 




A/G 


170076291 


219181 


SG05S997 


KCP 219181 




A/G 


170076439 


219329 


SG05S998 


KCP_219329 


rs801987 


C/G 


170077257 


220147 


SG05S911 


KCP_220147 




A/T 


170078779 


221669 


SG05S912 


KCP 221669 




C/G 


170078881 


221771 


SG05S1281 


KCP_221771 




C/T 


170078909 


221799 


DG00AAJHJ 


KCP 221799 


rs7733559 


A/T 


170078966 


221856 


SG05S913 


KCP_221856 


rs77 13498 


C/T 


170079102 


221992 


SG05S1282 


KCP_221992 


( 


C/T 


170079170 


222060 


SG05S175 


KCP_222060 


( 


C/T 


170079176 


222066 


SG05S1283 


KCP_222066 


i 


A/T 


170079986 


222876 


SG05S1284 


KCP_222876 


l 


VG 


170080026 


222916 


SG05S914 


ECCP_222916 


( 


:/t 
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170080378 


223268 


SG05S915 


KCP_223268 


rs4868017 


C/T 


170080480 


223370 


SG05S916 


KCP 223370 




C/T 


170080678 


223568 


SG05S917 


KCP_223568 




G/T 


170080917 


223807 


SG05S918 


KCP 223807 




C/G 


170081127 


224017 


SG05S919 


KCP 224017 


rs6555913 


A/G 


170081263 


224153 


SG05S1285 


KCP_224153 




G/T 


170081464 


224354 


SG05S920 


KCP_224354 




C/G 


170081779 


224669 


SG05S231 


KCP_224669 




A/C 


170082330 


225220 


SG05S177 


KCP_225220 




A/G 


170082361 


225251 


SG05S1286 


KCP_225251 




A/T 


170082496 


225386 


SG05S922 


KCP_225386 




C/T 


170083131 


226021 


SG05S1287 


KCP_226021 




A/C 


170083226 


226116 


SG05S1288 


KCP 226116 




C/G 


170083558 


226448 


SG05S924 


KCP_226448 




A/G 


170083941 


226831 


SG05S925 


KCP_226831 




A/G 


170084576 


227466 


SG05S926 


KCP_227466 




C/T 


170084823 


227713 


SG05S927 


KCP 227713 




A/G 


170084981 


227871 


SG05S178 


KCP 227871 




C/G 


170085097 


227987 


SG05S483 


KCP_227987 


rs2277951 


C/T 


170085116 


228006 


SG05S187 


KCP 228006 


rs2277952 


C/T 


170085151 


228041 


SG05S928 


KCP_228041 




A/T 


170085191 


228081 


SG05S929 


KCP 228081 




C/T 


170085217 


228107 


SG05S179 


KCP_228107 




A/T 


170085834 


228724 1 


SG05S1289 


KCP_228724 




A/G 


170086059 


228949 


SG05S999 


KCP 228949 




C/T 


170086143 


229033 


SG05S1000 


KCP_229033 




C/T 


170086250 


229140 


SG05S1001 


KCP_229140 




C/T 


170086709 


229599 


SG05S930 


KCP 229599 




A/C 


170086826 


229716 


SG05S931 


KCP 229716 




C/T 


170087721 


230611 


SG05S932 


KCP 230611 


rs6894038 


C/G 


170087734 


230624 


SG05S933 


KCP_230624 


rs6894316 


A/G 


170087780 


230670 


SG05S934 


KCP_230670 


rs6875006 


G/T 


170087950 


230840 


SG05S1290 


KCP 230840 




A/G 


170088932 


231822 


SG05S1291 


KCP 231822 


rsl422978 


C/T 


170089182 j 


232072 


SG05S1292 


KCP 232072 


rs2194160 


C/T 


170089631 


232521 


SG05S1293 


KCP_232521 


rsl592987 , 


KIT 


170090569 


233459 


SG05S935 


KCP 233459 


rs6870201 , 


VG 


170090765 


233655 


5G05S989 


K£P_233655 i 


•S2032863 , 


VG 


170091557 


234447 ! 


5G05S936 


KCP_234447 i 


•S6876375 i 


VG 
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170091681 


234571 


SG05S937 


KCP_234571 




C/T 


170091700 


234590 


SG05S938 


KCP_234590 




A/T 


170092075 


234965 


SG05S939 


KCP 234965 




C/T 


170092275 


235165 


SG05S940 


KCP 235165 


rsl363710 


G/T 


170092318 


235208 


SG05S941 


KCP_235208 


rs 13637 11 


A/G 


170092468 


235358 


SG05S942 


KCP 235358 




A/G 


170093047 


235937 


SG05S1294 


KCP 235937 




A/C 


170093362 


236252 


SG05S943 


KCP_236252 




A/T 


170094119 


237009 


SG05S1295 


KCP 237009 




A/G 


170094581 


237471 


SG05S944 


KCP 237471 


rsl422979 


A/G 


170094615 


237505 


SG05S188 


KCP 237505 


rs4867628 


C/T 


170094780 


237670 


SG05S1296 


KCP 237670 




G/T 


170095344 


238234 


SG05S945 


KCP_238234 




C/T 


170095662 


238552 


SG05S947 


KCP 238552 




C/T 


170095701 


238591 


SG05S180 


KCP 238591 




C/T 


170096774 


239664 


SG05S949 


KCP_239664 




C/G 


170097477 


240367 


SG05S950 


KCP_240367 


rs6879997 


C/G 


170098637 


241527 


SG05S190 


KCP_241527 


rs 13637 13 


G/T 


170098914 


241804 


SG05S191 


KCP 241804 


rsl055381 


C/T 


170099451 


242341 


SG05S951 


KCP 242341 


rsl363714 


A/G 


170099467 


242357 


SG05S952 


KCP 242357 


rs6872337 


G/T 


170106814 




SG05S1608 


SG05S1608 


rs 1544762 


G/T 


170106833 




SG05S1609 


SG05S1609 




C/T 


170106887 




SG05S1610 


SG05S1610 




A/C 



2345.2049-005 
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Table 13. The Build 33 location of SNPs and microsatellites employed for the 
subsequent association analysis across KChlPl. 



c*«-* mux 

start (i>33) 


Marker 


Public alias 


deCODE alias 


Variation 


1 AO/477QQA 

ioy4/ /OOO 


rsloyjJUl 


rslo953Ul 


SG05S2143 


C/T 


i /:q<aaq77 


•■o 1 /I777C7 

TS14ZZ /0Z 


1 a inci 
rSl4ZZ7jZ 


OPAfP 1 /-i / 

SG05S1616 


C/T 




TS14ZZ /04 


TS14ZZ7D4 


SG05S1617 


AJG 


IOVODj /Uo 


p\r^ c c 1 1 71 
LHj3M 1 Id 








loyooizuz 










ioyo/3!>iy 


CPA^C 077 


rso5ol73U 


SG05S872 


AJG 




oLiUDk>o /3 


w n7crvoA 
rsyZDUoU 


SG05S873 


A/G 


1 AQ/;0'2777 

ioyoy3 / /z 


JJIjjo4j 








loyoyoo / / 


KLr rs3 1 j / /3 


••ol 1 C7T3 

rs3 1 j 1 15 


SG05S76 


A/G 


1 £0707777 

loy /uz3 i i 










loy /uj duo 


ovjUjoo /o 


ml 1 C7C7 

rs3lD / j / 


SG05S876 


A/G 


ioy /uy / jo 


v-pp fnO<77/;7 

JvUr rsyjZ/o/ 


••oOC77/C7 

rsyjZ /o / 


CPACC7A 


G/T 


loy /4Uooo 


t/"\tt> 7/1777 


rs3 14155 


SG05S16H 


A/G 


1 cc\i a A7 ni 
loy /4U/U3 


KJNtf Z4Z59 




DGOOAAIGF 


A/G 


1 £07/1 1 1 77 

loy /4i i /z 


JviNr>_Z4/Zo 


rszo5oo4z 


DGOOAAIGG 


G/T 


loy /4j43o 


T\/*^ CO 1 1 70 








loy /4oiiy 


JsJNB_Zyo9j 




DGOOAAIGH 


C/T 


1 £07/170/1 1 

loy /4 /y4i 


T7-\yr) *}1At\H 

KJNB 31497 




DGOOAAIGI 


A/G 


loy /M /4Z 


ir\rn idfto 

JsJNB 3jZ9o 


rs2075612 


DGOOAAIGZ 


A/T 


1 £07^ 1 0 1 A 
loy /Mol4 


isJNB 3!>37U 




T\/~*f\i\ A A TTT A 

DGOOAAIHA 


C/G 


1 £Q7^ 1 QA1 
loy /jlo43 


lsJNB_3j3yy 


-n^AO CAO 

rs /03jUo 


DGOOAAIHB 


A //^ 

A/G 


loy Ojoou 


t^pD -p'l 1/11 in 

Ktr rs3141Zy 


_ n o 1/11 in 
rs3141Z9 


SG05S83 


C/T 


1 AQ7Q77A.1 

ioy /oZZU3 


Ktr rsl 0339© 


rslo339o 


PPArP 0*7 

SG05S87 


C/T 


ioy /oooyo 


DLOo4 / 








1 AQ70/1^77 

ioy /y4 jzz 


T\(~* CO 1 CQ7 

ixjjm !>yz 








169815996 


rs1 0^2856 




OVJU J070 


wijr 


169833941 


rs2055606 


rs2055606 


SG05S1621 


C/T 


169843903 


DG5S119 








169859275 


KCP rs888934 


rs888934 


SG05S93 


A/G 


169867465 


KCP 10355 




SG05S229 


A/T 


169867556 


KCP 10446 




DGOOAAHAR 


C/G 


169869845 


rs933656 


rs933656 


DGOOAAFCS 


A/G 


169869955 


rs2339091 


rs2339091 


DGOOAAFCI 


G/T 


169890996 


rsl862331 


rsl862331 


DGOOAAFCL 


C/T 


169895699 


KCP 38589 




SG05S953 


A/C 


169922309 


KCP 65199 




SG05S955 


A/G 



2345.2049-005 



-211- 



169939578 


KCP 82468 


rs4242158 


SG05S957 


A7G 

ru VJ 


169942903 


KCP 85793 


rs6892514 


SG05S958 


C/T 


169950655 


KCP 93545 




SG05S64 


G/T 

VJ/ 1 


169951970 


DG5S955 








169954954 


KCP 97844 


rs2202438 

X \*3 Am* Am* \y Am* 1*/ W 


DG00AAJTA 


A/T 


169959992 


KCP 102882 




DG00AAJTR 


r/T 


169961410 


DG5S13 








169964490 


KCP 107380 




DG00AATTC 

IJVJ w vrLTW Ivy 


A/G 

/x/ VJ 


169965813 


KCP 108703 




SG05S230 


G/T 

VJ/ 1 


169981987 


KCP 124877 

IfcVl X A* ■ vJ' / / 


rs4146511 

i. o r x r v/ x x 


SG05S194 


r/T 


169983196 


KCP 126086 


rs2202437 


SG05S195 


A/G 

tv/ vj 


169983318 


KCP 126208 




DG00AATHA 


T/C 

1 / v^ 


169986203 


KCP 129093 


rs4867995 


SG05S152 


p/g 


169986237 


KCP 129127 


rs4867619 


SG05S480 

kJ VJ \J -J U i O VJ 


A/G 

/xy VJ 


169986800 


KCP 129690 


rs4867996 


SG05S182 


G/T 

VJ/ 1 


169987419 


KCP 130309 




DG00AAJHB 


A/G 

rv vj 


169987667 


KCP 130557 


rs905822 


SG05S196 

vj VJ v/~/ kJ 1 7U 


r/G 

w vj 


169987873 


rs905823 


rs905823 


SG05Sn02 

kJvJV/JiJ 1 JV/z, 


A/r 


169988354 


KCP 131244 

x»*x»y.x x »y x a* • r 


rs905824 


SG05S197 

WJVJ V/^kJ 1 7 / 


A/G 

fx/ VJ 


169988964 


KCP 131854 

Xm.X->X X X \*J *J r 


rs6861734 


L7\JV JU 1 -^«J 


G/T 

VJ/ 1 


169989787 


KCP 132677 




SG05S154 

kJVJ V/^O 1 ,J*T 


A/G 

rv vj 


169991155 


KCP 134045 




WJVJ V/fc/Lj I/O 


r/T 

vw 1 


169992628 


KCP 135518 

*.*.vx X X V 


rs4868003 


SG05S200 


G/T 

vj/ 1 


169993146 


KCP 136036 




vjvJV/-/OtO 1 


A/r 


169994770 


KCP 137660 

-X X X / v/v/v/ 




SG0SS202 


A/G 

/x/ VJ 


170000722 


KCP 143612 

X XV/ X X T w/ V/ X ^* 


rs4867622 


SG05S66 


A/G 

/Ay vj 


170002070 


KCP 144960 


rs2279873 


SG05S203 


r/T 

vw 1 


170003856 


KCP 146746 




SG05S482 


r/T 

V_// 1 


170006326 


KCP 149216 


rs6555908 


SG05S205 


A/G 

/vy vj 


170006645 


KCP 149535 


rs883849 


SG05S206 

kJ VJ V/ kj ^ V/ \J 


A/G 

*V/ VJ 


170006645 


rs883849 


rs883849 


SG05S206 


A/G 

rV/ vjr 


170013842 


KCP 156732 

A J- X \J $ Am* 


rs924876 


SG05S207 


A/T 

£X7 X 


170015727 


KCP 158617 


rs2036559 


SG05S67 


C/T 


170015858 


DG5S123 








170017255 


KCP 160145 




SG05S208 


A/G 


170022007 


KCP 164897 




SG05S209 


A/G 


170026344 


KCP_1 69234 




SG05S156 


A/G 


170030958 


KCP 173848 




3G05S210 


A/G 


170031092 


KCP 173982 


rs6875696 


3G05S157 


A/C 


170031353 


KCP 174243 




DG00AAJHF 


A/G 



2345.2049-005 
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170032216 


KCP 175106 


rs2339095 

I D^J J7V/7 J 


SG05S1 


c/o 


170032280 


KCP 175170 

A *.V_y A A / y ± f \J 


r c6555910 

lOV7«/«7«/ J7 1 V7 


SG05<v?1 1 


A/n 


170032362 

1 1 \J \J *S X» *J \S 


KCP 175252 

AV.V-/A. A / y y 


r c7721722 


nnooA atptp 


A/n 

rVVJ 


170033946 

A » \J\J *J ~J S\\J 


KCP 176836 

AX.v^ A A / V/O JU 




DPOOA ATHTT 


A/n 


170037283 


KCP 180173 

AWA A OV7 L / y 


r<?2 135046 


uUUJj 1 J7 


P/T 


17fHtt77Jtt 

1 / \J\Jj / XOJ 








1 


170037347 


KCP 180237 

AW^A 1 OuLJ / 


fq7115047 


QP0S<^71 7 




170041 190 


KCP 184080 


i-q77Q7147 






170041996 


DG5S124 

A/VJ Jul 








170042689 


KCP 185579 




npoo a a rnv 


P/A 


170043158 


KCP 186048 




^00^^71 1 
ovji/joz. 1 J 


a /r; 


170043789 

1 / V/V/^-J 1 \jy 


KPP 186679 

ivv^a 1 OUU / y 




OUUJOlOl 


p/p 
uu 


170044226 


KCP 187116 

AVA_/A AO/ AAV/ 




nnooA a rnv 


A/P 


170044368 


KCP 187258 

A^A^> A AO / LJO 




cr;0S^1 67 

OVJU Jul OZ 


P/T 


170044798 

A / \J\J T 1 / y Kj 


KCP 187688 




nnooA A TF>7 


T/A 


170046441 

A / \J\J~\J J T A 


KCP 189331 




QH0SS171 4 

OvJUJoZ 1*+ 


A/P 
/Vvjr 


170049852 


KCP 192742 


r*1 971599 


nnooA a rPR 


T/P 


170050303 

A / \J\J *J\J ^J\J ~J 


KCP 193193 

avv>a i yy i yy 




Dt^OOA A TFP 


P/A 


170051066 


KCP 193956 


i-q7707440 




P/T 


170051726 


KCP 194616 


r o7036560 


nnoo a a n?Pt 


T/P 


170053658 

A / \J\J \J *J\J+J\J 


KCP 196548 




nnooA a tp v 


A /P 


1 70054788 


KCP 19767R 


r<:Q67R04 


nnoo a a JT^fir 


T/P 


1700S48RS 


I^PP 1 0777^ 




r\nr\n a a tt? u \ 
Ukjkjxj AAJ rirl 


U/ 1 


170056043 

A / \J\J -J \J\J~ ~J 


KCP 198911 




"nr^ooA a tf t 


P/A 


170056475 


KCP 19Q16S 






A in 


170056955 

A / \J\J y \J y ■y y 


rs2339119 

1 oZ. J y y 1 y y 


r<37llQ1 1Q 


FiHOOA Aprp 


A /P 


170057351 

A 1 \J\J y 1 y y x. 


CCP 200241 




nnooA A TFT 


A/P 
/AY W 


170059095 

A / \J\jyy\jyy 


KCP 201985 




nnoo a a IT? A/r 


P/A 


170059177 


KCP 202067 


r<;222 1 440 


nnooA a tfist 


A/P 


170059905 


KCP 202795 


r<;875184 

ADO / y 1 OT 


nnooA ATFn 


P/T 


170061292 


rs872435 


rs872435 i 

LOO / X." J J 


Q 005^981 


P/T 


170061352 


KCP 204242 


rs6897144 

LOU07 / J*T*T 


<sO05<s166 


P/T 


170063377 


KCP_206267 




SG05S168 


A/G 


170064771 


KCP 207661 




SG05S169 


C/G 


170064881 


rs329468 


rs329468 


SG05S896 


A/G 


170065075 


KCP 207965 




SG05S170 


C/T 


170068635 


KCP 211525 




SG05S173 


A/G 


170068960 


KCP 211850 


rs329470 


5G05S185 < 


Z/T 


170069885 


KCP 212775 


rs4349730 


3G05S186 


\JG 


170070041 


rs50057 


rs50057 


5G05S1270 





2345.2049-005 
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170073252 


rs50364 


rs50364 


dgooaafcd 


A/ft 

rV/VJ 


170078909 


KCP 221799 


rs7733559 




A/T 


170080678 

X # WWW / *J 


KCP 223568 




QO05991 7 


VJ/ I 


170081292 


KCP 1152 






p/r 


170081473 


KCP 1333 




^005^991 


A /ft 


170082330 


KCP 225220 




OVjUJOl / / 


/VvJ 


170082789 








err 


170084981 

X. 1 vVOT/Ol 


KCP 227871 




QPfKQI 78 


p/p^ 


170085097 


KCP 227987 


r<2??77951 




P/T 


170085115 


KCP 4976 




oVJvJOl O / 


P/T 


170085217 


KCP 228107 




^005^1 7Q 


A/T 


170085217 


KCP 5077 




QPfKQI 7Q 


A IT 


170089631 


KCP 232521 


r o1 <sQ?Q87 




A/T 


170090765 


KCP 233655 


rs2032863 


^005^989 


A /ft 


170094615 


KCP 237505 


rs4867628 


QG05S188 


P/T 


170095540 


KCP 15400 




^nO^^Qdfi 


P/T 


170095701 

1 l \J\J S +J 1 \J 1. 


KCP 238591 






P/T 


1 70096292 






QPft^QQ/ift 
&VjUj&y'*o 


A /P 
A/Lr 


170098209 


KCP 241099 


rsl363712 


SG05S189 


C/T 


170098209 


KCP 18069 


rsl363712 


SG05S189 


C/T 


170098637 


KCP 241527 


rsl363713 


SG05S190 


G/T 


170098914 


KCP 241804 


rsl055381 


SG05S191 


C/T 


170105556 


D5S625 








170167429 


DG5S959 








170361737 


rsl551583 


rsl551583 


SG05S1619 


C/G 


170389497 


rsl457692 


rsl457692 


SG05S1618 


A/G 



.2049-005 
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In order to define SNP-only haplotypes, 66 SNPs (Bold entries in Tables 12 
and 13) were further genotyped totalling 948 diabetic patients (538 with BMK30; 410 
with BMI>=30) and 570 controls across 600kb of KChlPl, of which 58 were 
concentrated in the 231kb region encompassing exon lb, the large intron (where Hap 
Dl resides) through to exon 8. The most significant 7-SNP haplotype (Hap E- see 
Table 14) observed in non-obese T2D (p = 1 .33x10"*) is significantly correlated with 
Dl (D' = 0.76 between Hap Dl and Hap E) and captures approximately 75% of the 
chromosomes that carry Hap Dl . The relative risk of this 280kb haplotype for all 
diabetes patients is 1.77, with a carrier frequency of 40%. 

Hap E can be made more specific by adding more SNPs, e.g. by adding 
DG00AAJEH, the relative risk increases to 2.28 in all diabetic patients vs controls. 
This variant of Hap E, which we denote Hap E\ has a carrier frequency of 20.1% in 
all diabetes patients and population attributable risk (PAR) = 12.3% 
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Table 14: Alleles contained within Hap E and Hap E' 



Haplotype 


Length (kb) 


































HapE 


280 


G 


rs1032856 


G 


KCP rs888934 


T 


KCP 93545 


C 


KCP 102882 


G 


KCP 169234 


G 


KCP 186048 






A 


KCP 16152 


HapE* 


280 


G 


rs1 032856 


G 


KCP rs888934 


T 


KCP 93545 


C 


KCP 102882 


G 


KCP 169234 


G 


KCP 186048 


C 


KCP 197775 


A 


KCP 16152 



20 





p-val 


r 


#aff 


aff.freq 


aff.freq (carr) 


#con 


con.freq 


con.freq (carr) 


info 


T2D BMK30 


1.33E-06 


1.929 


525 


0.2549 


0.379841195 


527 


0.1506 


0.255828099 


0.654 


T2D BMO30 


0.015813 


1.451 


387 


0.1959 


0.315039082 


527 


0.1437 


0.24617045 


0.670 


T2D All 


5.04E-06 


1.769 


912 


0.2270 


0.350961655 


527 


0.1424 


0.244220163 


0.661 


T2D Males 


1.56E-05 


1.831 


526 


0.2337 


0.358157968 


527 


0.1428 


0.244772025 


0.650 


T2D Females 


0.001484 


1.623 


386 


0.2187 


0.34178225 


527 


0.1471 


0.250938707 


0.651 





p-val 


r 


#aff 


aff.freq 


aff.freq (carr) 


#con 


con.freq 


con.freq (carr) 


info 


T2D BMK30 


0.000185 


2.248 


518 


0.1229 


0.215573079 


453 


0.0587 


0.110434655 


0.571 


T2D BMI>30 


0.015423 


1.896 


379 


0.0966 


0.174484759 


453 


0.0534 


0.101016667 


0.517 


T2D All 


0.000105 


2.279 


897 


0.1130 


0.20051463 


453 


0.0530 


0.100301178 


0.535 


T2D Males 


0.000551 


2.243 


517 


0.1098 


0.195525378 


453 


0.0521 


0.098826539 


0.545 


T2D Females 


0.004482 


1.976 


380 


0.1101 


0.195946622 


453 


0.0589 


0.110893515 


0.564 
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The teachings of all publications cited herein are incorporated herein by 
reference in their entirety. While this invention has been particularly shown and 
described with references to preferred embodiments thereof, it will be understood by 
5 those skilled in the art that various changes in form and details may be made therein 
without departing from the scope of the invention encompassed by the appended 
claims. 



